Gene Cag_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1809 
Symbol 
ID3746924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2330964 
End bp2332094 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content49% 
IMG OID637774347 
Producthypothetical protein 
Protein accessionYP_380103 
Protein GI78189765 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.895599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCAA TCGCCCCCAT AATTCCATTC TTGCTGCTCT TTTTGTGGCT TTTGCAAGGG 
TGCGCTTCCG ATAGAGCACC ATCAGGCGGT AGTGCCGATA CCACGCCTTT GCGCTTATTA
GCATCAACTC CCATAAATGG CACACAAAAT TTTAAAGGCA ACCAGCTTCA GCTTTACTTT
AGCCACGAAG TGAGCAGTCG TGCACTGTTA CGCGCACTTC GCACATTCCC TGATATAGGA
CAATTTGAAC TGACGGTAAA CGGCAAACGC GCCGATATTC AGTTACTGGA TACTTTACAA
GCCAACCAAA CCTACACCTT GCTGCTGAAT CGCCACTTGA ACGACTTTCG TGGGCAGTTG
CTCCACGCGC CAACCACTCT TGCGTTTTCA ACAGGCAACA ATGTGAATAA TGGCACCATA
CGTGGCACAG TAGTACAGTA TAACGGCACA CCAGCCAGCA ACGCTTTACT CCTTGCCTTT
GCAAGCGCCG AAAAAGGGGC AACGGTTAAT TTGTTGGAAA ACAAGCCAAC ACAGATAGCA
CAATGCGATG CTTCGGGCAG CTTTGCGTTT AACCATTTGC CGCATGGCAG CTACCACGTA
GTAGCCATCA ACGACCGCAA CCACGACCTT GCATGGGCAC CAAGCAGCGA AGAGTACGCC
ACTCCAAGCC AGCCACTTAT GGCAACAAAC AGTGCAAACC AACTATTGCG CCTTTCGCCT
CCACTTAAAA GCCCAAAGCC GCTCAAAATC CCTTTGGAAG CCTCTTCAGC CCCAACAAAT
TCAACGATTG CAACAGGTAG CCTGAGCGGC ATGTGTACGG TACGTGGCAA TCCACCAAGC
GTAATTATTG AAGCTATCTC GCCATCAGCC ACTTACTACA CCGTTGCGGT GCGCAAAAAA
GCAGGCAGCT ACACCTACCA TTTTAACCAA TTACCCGTTG GAGACTACAC GATTACCGCT
TCCATTCCAA CCGCAAGCTA TCAGCCAAAC CAAGCATGGC AATGGAATGC AGGTTCAGTA
GCACCTTTTG TGCCCTCTGA TAGTTTTACC TTTTATCCTG AAACCGTTAC CATTCGAGAA
GAGTGGCTTA CCGAACGCAT TAACATTACC TTTCCTACCA TTTTACAGTA A
 
Protein sequence
MPSIAPIIPF LLLFLWLLQG CASDRAPSGG SADTTPLRLL ASTPINGTQN FKGNQLQLYF 
SHEVSSRALL RALRTFPDIG QFELTVNGKR ADIQLLDTLQ ANQTYTLLLN RHLNDFRGQL
LHAPTTLAFS TGNNVNNGTI RGTVVQYNGT PASNALLLAF ASAEKGATVN LLENKPTQIA
QCDASGSFAF NHLPHGSYHV VAINDRNHDL AWAPSSEEYA TPSQPLMATN SANQLLRLSP
PLKSPKPLKI PLEASSAPTN STIATGSLSG MCTVRGNPPS VIIEAISPSA TYYTVAVRKK
AGSYTYHFNQ LPVGDYTITA SIPTASYQPN QAWQWNAGSV APFVPSDSFT FYPETVTIRE
EWLTERINIT FPTILQ