Gene Cag_1765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1765 
Symbol 
ID3746625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2281429 
End bp2283219 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content48% 
IMG OID637774302 
Productpeptide ABC transporter, periplasmic peptide-binding protein 
Protein accessionYP_380059 
Protein GI78189721 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.220703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGATGA ATAGTATTCT CAAAAAAACC GCCACGTTTC GTCGGTTGGT GGGCGTGGTG 
GCAACGACGG CGTGTACGCT TTTAACGGCG TGCGGTGGTA GCAATAGCAA ATCTACAGCA
CAAGTGGGCA CGCCAGCCAT GGATTCCACC TTGGTGATTG CCATGCTGGG CGATGCCGAT
TATCTCAATC CTGTGCTTGC GGGTACAGTA ACCTCAAGCA ATATTGTTGG GCTTATGTCG
CCATCGCTGT TGCAAAGTGA GTTTGATACC ACTACGGGGT TGTTGAATTA CATGGCGCTT
GAAAAAAAGC TTCGGGCGGG CGGGAGTAGC AGCAAAATGC CACAAGGTGC GGTAGCAAAA
AGTTGGACTA TGTCGGCGGA TCACACAACG CTTACCTACA CCTTGCGAAG CGATGCTTTT
TGGAGCGATG GCAAACCTGT GGTTGCTGAG GATTTTAAGT TTACCTATCA ACTCTATGGC
AACCCGTTGA TTGCAAGTGC TCGCCAACAA TATCTTGCTG AACTTGTTGG TGCCGATAAA
GGGCAAATTG ATTTCAACCG TGCCATTGAA GCACCGAACG ATACCACCCT TATTTTCCGC
TTTTATAAAC CCGTTCCTGA ACATTTAGCG CTCTTTCATA CCTCACTTGC ACCCTTGCCT
GCCCATCAAT GGAAAGGGGT AAAAGCTGAG GAGTTCCGTC AGTCGCCGCT AAATTTGCAG
CCACTTTCCG CCGGTCCATA TCGCTTAACG CGCTGGACGC AGCAGCAAGA AATTGTGCTG
GGGGCAAATC GCACCTCTAC GCTGCCAAAG CCGGGCAACA TTCCCACCCT AAGCTTTCGA
GTGGTGCCCG ATTACACCGT CCGCTTAGCG CAATTACAAA CGGGTGCGGT TGATGTGGTG
GAAAATATTA AGCCCGAAGA TTTTTCGACC CTTAAGCAAG CGAAGCAACC GGTTGATATT
AAAACACTTG GGCTTCGTGC TTACGATTAC ATTGGGTGGT CAAATATTGA TTTTACGGAG
TACCGTAAAA ATCATCGCAT AAAGCCGCAT CCGCTCTTTG GCTCACCAAC AGTACGCCTT
GCCTTAACGC AAGCAATTGA TCGTGAAGCT ATTATTGATG GCTACTTGCG CGAGTATGGC
GTTGTGTGCA ATACCGATAT TTCGCCATCC TTAAAATGGG CATACAACAA CAAAATTACC
CCACACCCTT ACGATCCTGC AAAAGCTAAA GCGTTGCTTG CGGCTGATGG ATGGAAGTAT
GGCGCTGACG GAATTTTGCA AAAGCAAGGC AAGCGTTTCA GTTTTGTGCT ACATACCAAT
GCTGGCAATG CGCGTCGCAA CTATGCAAGC GTTATTATTC AGCAAAATTT GCGCGCCATT
GGTATTGAGT GTAAGCTTGA GGTGCAAGAG TCAAACGTCT TTTTTGAAAA CCTTCAACAG
CGCAAGCTTG ATGCGTGGCT TGCAGGATGG GCAATTGGAT TAGAAATTGA TCCGCTTGAT
ACATGGGGAA GTAATCTTGA AAAGAGCCGC TTTAACTTTA CGGGCTATCA AAATCCCCGT
ATTGAACAAC TTTCAGCGCT TGCCAAGCAA AAAATGGAAC CAACGGGCGC TCGCCCCTAC
TGGCTTGAAT ATCAAGAAAT TCTCCATCGC GATCAGCCCA TTACCTTTTT GTATTGGATG
AAAGAGACGC ATGGTTTTAG CCGCCGTATT CAAGGCGCAG AGCTGAATAT TGCTGGTGCT
TTTTATAACC TTGATGATTG GAAGTTGCAG CCATCAGCTT CCATCCAATA A
 
Protein sequence
MVMNSILKKT ATFRRLVGVV ATTACTLLTA CGGSNSKSTA QVGTPAMDST LVIAMLGDAD 
YLNPVLAGTV TSSNIVGLMS PSLLQSEFDT TTGLLNYMAL EKKLRAGGSS SKMPQGAVAK
SWTMSADHTT LTYTLRSDAF WSDGKPVVAE DFKFTYQLYG NPLIASARQQ YLAELVGADK
GQIDFNRAIE APNDTTLIFR FYKPVPEHLA LFHTSLAPLP AHQWKGVKAE EFRQSPLNLQ
PLSAGPYRLT RWTQQQEIVL GANRTSTLPK PGNIPTLSFR VVPDYTVRLA QLQTGAVDVV
ENIKPEDFST LKQAKQPVDI KTLGLRAYDY IGWSNIDFTE YRKNHRIKPH PLFGSPTVRL
ALTQAIDREA IIDGYLREYG VVCNTDISPS LKWAYNNKIT PHPYDPAKAK ALLAADGWKY
GADGILQKQG KRFSFVLHTN AGNARRNYAS VIIQQNLRAI GIECKLEVQE SNVFFENLQQ
RKLDAWLAGW AIGLEIDPLD TWGSNLEKSR FNFTGYQNPR IEQLSALAKQ KMEPTGARPY
WLEYQEILHR DQPITFLYWM KETHGFSRRI QGAELNIAGA FYNLDDWKLQ PSASIQ