Gene Cag_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1866 
Symbol 
ID3747018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2374880 
End bp2375887 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content39% 
IMG OID637774403 
Productputative DNA-binding protein 
Protein accessionYP_380159 
Protein GI78189821 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATGA AAAATAATCA ATCATCCTTT ATTCTCTTTA CAACCGAAGA TGCAAAAATA 
GCGGTAGATG TTCGCTTTGA GGAAGAAACA GTATGGCTTA CGCAAGAGCA AATGGCAGTT
TTATTCGGCA AGGCACGCAC AACCATTACC GAACACATAC AAAATGTTTT TAAAGAAGGT
GAGTTAAACG AAGAAGTGGT GTGTCGGAAT TTCCGACATA CCACTCAACA TGGAGCAATT
GAGGGGAAAA CACAAGAAAC ATGGGTAAAA CATTATAATT TAGATGTTAT TATCTCTGTG
GGCTATCGCG TAAAATCTTT GCGAGGAACG CAGTTTCGTC AATGGGCAAC CAAACGCCTG
AACGAATATA TCCGCAAAGG TTTTACTATG GACGATGAGC GCCTTAAAAA TATAGGCGGT
GGAGGTTATT GGAAAGAATT GTTGCAACGC ATACGCGACA TTCGTGCATC CGAAAAAGTA
TTCTATCGCC AAGTGCTTGA TATTTACGCA ACCAGCATTG ATTACGACCC TAAAGACGAG
GTTTCGCTTG CATTTTTTAA AAAAGTGCAA AACAAAATTC ATTATGCAGT ACATGGTCAA
ACAGCAGCCG AGCTTATTTT CAATCGTGCC GATGCCGAAA AAGATTTTAT GGGTTTAATG
ACTTTCTCAG GCAGTCGTCC CTACTTGAAA GATGTGGTGG TGGTAAAAAA TTATTTAAAC
GAAAAAGAGC TTCGCGCACT TGGGCAAATA GTTTCAGGAT ATCTTGATTT TGCTGAACGA
CAAGCGGAGC GAGAACAAGC AATGACTATG AAAGATTGGG CGGAACATTT GGATAGAATT
TTAACTATGA GCGGTGAAAA GCTGTTGCAA GAAGCAGGCA CGATAAGCCA TGAAAAAGCG
GTAGAAAAAG CTACTACAGA GTATAAAAAG TATCAGCAAA AAACATTGAG CGAAGCAGAA
TATAATTACT TTGAAAGTTT GAAAATTTTA GAAAGCAAAA TTCATTAG
 
Protein sequence
MQMKNNQSSF ILFTTEDAKI AVDVRFEEET VWLTQEQMAV LFGKARTTIT EHIQNVFKEG 
ELNEEVVCRN FRHTTQHGAI EGKTQETWVK HYNLDVIISV GYRVKSLRGT QFRQWATKRL
NEYIRKGFTM DDERLKNIGG GGYWKELLQR IRDIRASEKV FYRQVLDIYA TSIDYDPKDE
VSLAFFKKVQ NKIHYAVHGQ TAAELIFNRA DAEKDFMGLM TFSGSRPYLK DVVVVKNYLN
EKELRALGQI VSGYLDFAER QAEREQAMTM KDWAEHLDRI LTMSGEKLLQ EAGTISHEKA
VEKATTEYKK YQQKTLSEAE YNYFESLKIL ESKIH