Gene CA2559_10733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_10733 
Symbol 
ID9297635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp2340080 
End bp2342041 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content30% 
IMG OID 
Productcapsular polysaccharide biosynthesis protein CapD 
Protein accessionYP_003716890 
Protein GI298208711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTATA GTGAAAATAA GATTAATGTT CGTAATATTA AATATCTTCC AAGATGGGTT 
GTATTAATAA TTGATACATT ACTTATTTGT ATTTCTTTAA TTGTATCAAT AATAATACTT
AATAGAATTA GAATAAGTCC TTTGGATGTT TTAGAATATT CCATTCAAGC ATCTATTGTA
ATTTCTGTTA ATGTAATCTT CTTCTTTTTA TATAAAACAT ACTCAGGACT TATAAGACAC
TCTTCAATGA TGGATGCATT AAAGCTGTTT TTAGCTTCAA TGAGTACCAC AGTTGTATGC
CTTATGTTAA ATGTATTTTA TCAAGGCACT TATGGTGATA AGATTTTTAT TACTACCGGT
CTTATAATGT ATTCATTTAT ATCATTTTCG ATTTTATTTT TATTTAGGAT AGCAATTAAA
CAACTCTATG AGTTTTTTAA GATAGCACAA CGTGAAGAAG ACCTAATTGA TGTAGCAATT
CTCGGAATAG ATGAAAATGC AATATCTATT GCTGCAGCAT TAGAAATGGA GCACCCTAAA
CGTTTTCTAG TAAAGGGATT TGTGTCTAGA AATAACTTTA AAAGACAGTT AAGAATATTA
GAAAAGCCTG TTGTTAGAGT AAAGGAGTCT ATTTTACCAT CATTAAGAAT ATTAAATGTA
AAGGCTATAA TAATTACTTC TGCTATAGAC CCAAAAGAAA AGCAGCAAAT AGTAGATGAG
TGTTTAGAGA ATGACATTAA GATGTATAGT TCTCCAATCT TATCAAATTT TGAGGAAGGT
AATTCTCCAC AAATACAAAA GCTGCAAATT GAGGATTTAT TAGAAAGAGA GCCTATTGTA
TTAAATAAAA AAAATAAAGA ACATCAATTA AAAAATAGAA CTGTCTTAGT AACTGGTGGA
GCAGGATCTA TAGGTAGTGA AATTGTTAGG CAAGTAGCAG AATACAAACC TAAGATGTTA
TTAGTATTAG ATCAAGCAGA ATCTCCTTTG CATCAAATAC AACTGGAGAT AAATGAGCAC
TTTCCTAACC TTAAGTACAA ATGTATAATT TGTGATGTTA CTAACAACGT TCGCTTGCAA
AGTGTTTTTG ATGAGTTTGA TATTGATGTT ATTTATCACG CAGCGGCATA TAAACATGTG
CCACTTATGG AGAACAACCC TCATGAAGCT ATTTTAACAA ACATTCTTGG AACAAAACAA
GTTGCAGATT TAGCTGCAAA GTTTAAAGTA GGTCACTTTG TAATGGTATC TACAGATAAA
GCAGTAAACC CTAGTAATGT TATGGGAGCT TCTAAAAGAG CTTCAGAAAT GTATGTGCAA
TCTCTAAATT ATAACTTACA ACTGAAAGAT AGATGTGCTA CAAAATTTAT AACTACAAGA
TTTGGTAATG TTTTAGGATC TAATGGATCT GTTGTCCCTT TATTTAAAAA ACAAATTGAA
GAAGGCGGAC CAGTTACCAT CACACATCCA GATATAATTA GATATTTTAT GACAATCCCA
GAAGCCTGTC AATTAGTTTT GGAAGCTGGT GCTATGGGTA AAGGTGGTGA GATATTTATT
TTTGATATGG GTAAACCTGT CAAGATTATG GATTTAGCCA AGAAAATGAT AAAGTTGGCT
GGATTTATAC CAAACAAGGA TATACATATT AAAATAACTG GATTAAGGCC AGGAGAAAAG
TTATACGAAG AACTGCTATC AGATGAAGCT AAAACTCTAC CTACACATCA TGAAAAAATT
ATGATTACTA AAGATAGAAA TGATAACTAT GAGTTTATCT CAAGCAGTAT AGATAGCGTT
ATTAATTCTG CAAATAATCA TAATAATGTG CGTGTGGTTC ACAAGCTTAA AAAAATGATT
CCAGAATTTA AAAGTAAGAA TTCATCTTAT GAGTCATTAG ATGCAGAGGT GTCGTTTAAC
GAGATAATAA ATGAGGATAC CGTTAAAATA AAAAAGCTTT AA
 
Protein sequence
MSYSENKINV RNIKYLPRWV VLIIDTLLIC ISLIVSIIIL NRIRISPLDV LEYSIQASIV 
ISVNVIFFFL YKTYSGLIRH SSMMDALKLF LASMSTTVVC LMLNVFYQGT YGDKIFITTG
LIMYSFISFS ILFLFRIAIK QLYEFFKIAQ REEDLIDVAI LGIDENAISI AAALEMEHPK
RFLVKGFVSR NNFKRQLRIL EKPVVRVKES ILPSLRILNV KAIIITSAID PKEKQQIVDE
CLENDIKMYS SPILSNFEEG NSPQIQKLQI EDLLEREPIV LNKKNKEHQL KNRTVLVTGG
AGSIGSEIVR QVAEYKPKML LVLDQAESPL HQIQLEINEH FPNLKYKCII CDVTNNVRLQ
SVFDEFDIDV IYHAAAYKHV PLMENNPHEA ILTNILGTKQ VADLAAKFKV GHFVMVSTDK
AVNPSNVMGA SKRASEMYVQ SLNYNLQLKD RCATKFITTR FGNVLGSNGS VVPLFKKQIE
EGGPVTITHP DIIRYFMTIP EACQLVLEAG AMGKGGEIFI FDMGKPVKIM DLAKKMIKLA
GFIPNKDIHI KITGLRPGEK LYEELLSDEA KTLPTHHEKI MITKDRNDNY EFISSSIDSV
INSANNHNNV RVVHKLKKMI PEFKSKNSSY ESLDAEVSFN EIINEDTVKI KKL