Gene Cag_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1967 
Symbol 
ID3747829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2497036 
End bp2499570 
Gene Length2535 bp 
Protein Length844 aa 
Translation table11 
GC content41% 
IMG OID637774503 
Producthypothetical protein 
Protein accessionYP_380258 
Protein GI78189920 
COG category[S] Function unknown 
COG ID[COG4694] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.972801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTACTC ACCCAGATGC AATCGGGTAT ATTCGTGACC TCGGACATTC TGAGGGCAAA 
CTTTGGTTTG AAATGATCTG CGATCTCGCT GCCTATAGAA CCACGAATTT GAGTCCTACT
GATTTTGAAA TTCTGGTACA ATTATTTACT AAACAGGTAA GCTATCTTCG GCAACCTCCA
CCTATAATAG CCACAACAGT ATCTGTCGAA ACCGTTTTTC CATCATTCGA ACGCCTGGAA
ACCATTGGCC CATTTAATGG ATTTAAACGT CTTGGGAATT CGCTTGTGGC GTGCTTTCCC
AAGCGAGCCA CGATCATTTT TGGAGCAAAC GGAAGCGGAA AATCTAGTTT GTGTGAAGCC
CTGAAAATCT TGGCATCCAA TGATGCGCCC CGTCGACCTC TCCATGATGT ACATTGTTCA
ACAACTGTGA CGCCTTCGTT CAAATTCAAG TTTACTACTG ATACAACCGC TCAGACATGG
AGTTCAACGG ATACTTATGG CACACGCTTA AGTGTCATCA AATATTTCGA TACAGGAATT
GCGATTCACA ACATCAAGAA CTCTGTCCAA CCAGGTCGAA TTATTGAGCT TGCACCTTTC
AAATTAGATG TATTTGAGAT AGCAAAAAAT CACTGCGAAG TTTTACGCAC AGAACGTCAG
AAACGACAGC GCGAAAATAC CGACCAGCTA ACAATTGTTA TAGAACAGAT ACGTGCAAAG
TTTGAATCTT TTGAAGGCAC TATACTCGCT GGATTACAAA TATCTGCGAA GTCAATACTT
GAAGCTGAGA TCAAATTGGG GGAGAATTAC TCCGAGGAAA ACGGGTTGGA CGAGAAGTTA
AAAAAGAAGT CCGATCTTGA AAAAGCCACT TCAGAAGAGG GCATTAAATT ACTCAAGGGT
GAAATTGCTG CATTAAAAGC CTTGAACGCA GAAATTGATC CGATTCTTAC TGCATCCGAA
AAGCTCGTTG AAATTGATCC TGTTGCCAAG TCCAATAGTT TAAAAGATAA AGAAACCGAG
TTGAAAGTTC TTGCAGAAGC GCTAATCCCA TCCGGAGCAA CTCTGGATAA ATTGATGGAA
CTTATCCGTC CGGCAAACGA GATTAGCATT CTTAATTCTT CAGAGTTAGA AGAGTGCCCT
CTTTGTAAGC AACCACTTCA GACGCGGGAA CTTGAACTTT TCAAGCAATA CTACACTCTC
CTAACAGGAG AACTGGACGC TGCTATTACT GAACTTCGTA AAATTCTTAA AACGTCAGAA
AAGAACCTTA AGGTGATCTC CGACTCGACG CCTGATGAAT GGGCAAAGGG TTCTGTTCTG
TCCCAAGATT TAATTGATGC CATCAAAGAT TCAGGAAGAG CAATACAGAA ATATTTTAAG
TTTGGAGAGA ACATCAGCCA AAACTGTAAA GATGCAGCTG TTTTATTGCG AAAATTCAGC
ATTAAATTAT CAATAAAAAC AGAAGAGAAG GAAACGCTCA TTGATAAGTC AGGAAAAGAC
CGCGAAGAGC TTCTAAAGCA ATTGACGCAA ATTTCAAACG AATGCAAAAA GTTGCTTTAT
GCCAAGTGTA TCGCGGACAA TATGGATCTC GTTAAGAATG CGCATGGCAA AATGTTAAAT
GCTACCTTTT GGGATACTAA CCTGCCAAAC TTCTCACCTG TGCTACGAAA AATCACCTCC
ACAGCAAAGA AGGCTCACAA GGAGCTTGTA GTAGAAGATT TTAAAAATCG ATTAAATGCA
GAATATCTTG CGCTCGCTGA AAAGGATATG AGTGCTTTTG GGGTAGAGTT GAAGGATGTC
GGTAGCGATG TTGCTGTTAC TGTCGATCAT CATGTTGCAG GTCAAAGGAT TGAGTCGGTT
TTGAGTGAAG GTGAACAACG AATCCACGCC CTTGCCTTGT TCTTTGCGGA ATTGGAAACT
TGCGAGCAAC AAGTAATCGT ATTTGACGAT CCGATCTCCA GTTTCGACTA TAACTATATC
GGAAACTACT GCAATCGTTT ACGCGATTTG ATTCAACAGT ACCCAGATCG ACAGATCATC
GTTTTAACGC ACAATTGGGA ATTTTTCGTT CAAATTCAAA CAACGCTGAA TACAGCCCGA
TTAAACCAAC ATATGTCTGT ACATGTTTTA GAAAGCTGCG TCGCAATCGA TGAATACAGC
GAGAATATTG ACGAGTTGAA AACCAATATA GACGCGATTT TACTTGGTTC AGGAGAACCG
ACCAAACAAC AGAAAGAAGC AATGGCGGGG AAGATGCGCC GTTTAATTGA AGCTGTGGTT
AATACGCATG TATTTAATAA GCAACGGCAT CAGTTCAAAC AGAAAAATCA ACAAGTATCA
GCTTTCGACG ACTTTACTAA AGTAGTTCCA CTTCTTCTCT CAGAGGCTCA AACTTTACGA
GATCTTTTTT CTAAACTGAG CATCACAGAG CATGACGACC CACGAAATGC CTACGTCAAT
ACTGATAAAA GTATGTTTCT AACCCGATAC AATGCAATAA AGTCAATAGA AACTGCAATT
ATTGGAAGGA AATAG
 
Protein sequence
MPTHPDAIGY IRDLGHSEGK LWFEMICDLA AYRTTNLSPT DFEILVQLFT KQVSYLRQPP 
PIIATTVSVE TVFPSFERLE TIGPFNGFKR LGNSLVACFP KRATIIFGAN GSGKSSLCEA
LKILASNDAP RRPLHDVHCS TTVTPSFKFK FTTDTTAQTW SSTDTYGTRL SVIKYFDTGI
AIHNIKNSVQ PGRIIELAPF KLDVFEIAKN HCEVLRTERQ KRQRENTDQL TIVIEQIRAK
FESFEGTILA GLQISAKSIL EAEIKLGENY SEENGLDEKL KKKSDLEKAT SEEGIKLLKG
EIAALKALNA EIDPILTASE KLVEIDPVAK SNSLKDKETE LKVLAEALIP SGATLDKLME
LIRPANEISI LNSSELEECP LCKQPLQTRE LELFKQYYTL LTGELDAAIT ELRKILKTSE
KNLKVISDST PDEWAKGSVL SQDLIDAIKD SGRAIQKYFK FGENISQNCK DAAVLLRKFS
IKLSIKTEEK ETLIDKSGKD REELLKQLTQ ISNECKKLLY AKCIADNMDL VKNAHGKMLN
ATFWDTNLPN FSPVLRKITS TAKKAHKELV VEDFKNRLNA EYLALAEKDM SAFGVELKDV
GSDVAVTVDH HVAGQRIESV LSEGEQRIHA LALFFAELET CEQQVIVFDD PISSFDYNYI
GNYCNRLRDL IQQYPDRQII VLTHNWEFFV QIQTTLNTAR LNQHMSVHVL ESCVAIDEYS
ENIDELKTNI DAILLGSGEP TKQQKEAMAG KMRRLIEAVV NTHVFNKQRH QFKQKNQQVS
AFDDFTKVVP LLLSEAQTLR DLFSKLSITE HDDPRNAYVN TDKSMFLTRY NAIKSIETAI
IGRK