Gene Cag_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1895 
Symbol 
ID3746794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2406533 
End bp2409610 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content47% 
IMG OID637774432 
Producthypothetical protein 
Protein accessionYP_380188 
Protein GI78189850 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACTGG GTGAGATGAT GATACAAGGT GTAGCAATGT CGGCATATAG CTGGTCGGGC 
TATAATGGTA CGGGCTACGA GAGTTTATTG CAGCAGGCGG TGGAGGTTGG GGCTACCTCA
GTGCTACTTG GTAGTGTTTC AATTATTGAC CTCAATAATG GAGCGGTTAG CGCGTGGGTG
CGTGATGATG GTTTTACCAC CACCGCAAGC ATGGGGGATG TTGAAGCGGC TATTCAACAA
GCGCAAGCGC ATGGTTTGCA GGTTTTTTTA AAGCCGCAAA TCCACTCCTA TAATCCAGCA
TCTGCTGCTT TTGGTGGTAA TCCATACAAC AATCTTATAA ATCCCGATCC GAGTAATCCG
CTTATTATTC CTAATCTCGA TCTCTTTTTT GAGGGCTATA AAGCCTATAT CGTAGAGTGG
GCGGAGCTTG CAGAGCGTTA TCAGGTGCCG CTCTTTAGCG TTGGGAATGA AATGGTGGCG
GTTACCTCGG CTGAGTTTAC GCCCTATTGG GAGGATATTA TTGCAAGTGT GCGCAATGTG
TATCATGGGC AGCTTACTTA TGCAGCAATG ACTGATGTGA AGTGGGATTC GAATGATGAG
GTATCGCACA TTGAATTTTG GGATAAGCTT GATTATGTGG GCGTTGATAT GTATCCCGAT
TTTGATACCG GTGCAACAAT CCCTACAACG CCAACCGTTG AGCAGCTTAA TGATATTTGG
GTAGAGCAAA AGTGGCAAAG CTATTTAAGT GCTATCGCCG AAGCAACGGG TAAGCCGCTT
CTTTTTACTG AAACCGGGGT GGCAAGCTTT TTGGGTGGAG CGAATCGTAG TCGTTATACC
GATGCGCTGA TTAGCCAAAT GGGCACTGTG CGTGATGATG CAACGCAAAC CAATTGGTTC
CAAAGCTTTG CTGAAACGTG GATGGGTGAG AACCAACCTG AGTGGTTTGG TGGCATGTAC
TTTTGGAATA ACGACCCTCC ATATAATGCA GGTTTACAAG ATATCACAGG CTATACCTTT
TTTGGTAAAC CTGCTGAAGT AGTGGTTAGT AGCCTTTTTG ATGCGGTCAA TAGCCTCGAT
TTTGATCAAA CACTTTTTCT TGCCAGCGAT AGTGATGACC GCATTGCTCT CTACAAATAT
ATTGCTGAAG CTGATGCAAA TCCGTTGACG CGAGCGCAAA GCTATCATTC CACCGTTATT
ATTGAGCTAA ACGGCACTAT TCTTGAAGGT GCTGAAGCGG TAACACCGAC CATCCATTTT
TATCTTAATG GCAAAGATTA TGGCGCTGTT ACGCTGAGCA ATGTTGAAAG TGAGTATAGC
ATAAATAAAG GAATTGAAGC TGCAAGTAAA GGTGAGGCAT ATTATCCTCA TAGCACGCTT
ATCCCATTTC TTTTTGAGAT AGATGAATTA GAGGTTCGCG ACATTCACAT TGTGCGCGAT
TCCGTGCAAG TGGAAAACAG CGAAGTGTAT ATCAGCCGTG TAACCATTGT GCCCGATATG
GGTGCGGCAA CGGTTAATAC AACGGTTAAT AGTTTACAAA ATGCGTGGCT TGCTTTTGAG
GAACCATCCC AAGCATGGGG TGGCGCAACA GGATATCAAT TTCCAAACGG TGCCATTCCT
TACGATGTAC CATCCGTTAC CATAGATACT TCACCCTATA AAAAAACGCT TGCCACCATG
AGCGGTACGC CTGACAATCC CATTACTGTT AAAGGATACG AAGGCTTCGA CACGGTTTAT
TTATTGGGTT CACCTGAGCA ATACACCATT ACTCTTGAGG GCGATATGCT CATGGTAGCC
GAAAGTAGTG GCTTAGGGCA AAATAGTCAG CTTGGCGGTG TAGAGCGCTT GCTTTTTGCG
GAGGCTGATT ATGCGCTGCT TTTTGGGGGT ATGGGGAATG ATACGCTGTA TGGTGGCGCT
GGCAACGACC GCTTTAACGG TGGTGATGGC AATGATGTCG TTTTATTAAG CGGCTATGCT
ACCGAATACG AAGTAAGCGA CAATGAAGCG AGTGCAACTT ATACCATTAC CGATAGTGTT
GCAGGACGCG ATGGCAGCTA TCAACTTAGT AACATGGAGG CGTTGCAATT TGGAGCATCG
CCAATGCAAT GGAATCTTGA AGAGTTTCGG GCGGCGCTTG CAGCCTCGCA ACTTCTCCCG
CAACAAGAAG AGCCATTCAA TGTAACAGGC TCAGTTACCT TTTGGAAAAA TGGTGCAGCA
ATCAGCAATG TGGCTACAAC GCTCTCGCTG CATTCGGTTA CCAACAATGG CGAAGAGCTG
CTCTTTCAAC ATTTGCAACA TCATGCTGAT GGAGGCTATA GTGTTGAAGT GTGGGCAAAT
GCGACGGATG CGCTGCATAG CCTCCAGTTC GAGTTCCAAC TACCAACAAA TGCGCAAGCT
GCGTGGCATT TTAGTGAAGA GGTGCCACAA GGCTGGCAAA CGGGGGTTAA TAATCAAGGT
GCTGATGCGT TGCTTATTGG TGGCATGGGC GCTACCGCAT TGCCGTCGGG ATTGGTGCAG
CTTGGCACCT TGAGCTTTGT GGCTCCGACT GATGCTGATC GGCTTGAAAT AGCGCTGACA
AGAGGTGAAC TTGGCAAGCA ATGGCTTGTT CCAGCAACCA TTACGCTTGA AAGTAATGTG
CTTGCAAGCA ATGGCGGTTA TCAGCATAAT GCACTATGGC AAGGTAGCTA CCATTTAAGC
GTGCAGCATG AAAGTACTGA AGAGCCAACC AACATGGTTA CCATGAGCGA TGCGTATGCC
GCGTTACAGA TAGCCGCAGG GCATAATCCC AATGAGTCTG AAGCGCCACT GCAATCATGG
CAATTCTTGG CTGCTGATAT AAATCGTGAT GGCAAGGTTC GTGCATCCGA TGCGCTTACC
ATTTTAAAAA TGGCGCTTAA TTACCACGAT GCTCCAAGTG AAGAGTTGAT TTTTCTGCCC
GAATGGGTGG GCAAGAGCGA GATGACGCGC AGTTCAGTTG ATTGGTCAGC GACTGAAATA
ATGCTTGATG TTGAAAATTA TCAAATTGTT AACCTTATTG GGGTTATTCA AGGTGATGTT
GACGGCAGTT TTAGTTAA
 
Protein sequence
MLLGEMMIQG VAMSAYSWSG YNGTGYESLL QQAVEVGATS VLLGSVSIID LNNGAVSAWV 
RDDGFTTTAS MGDVEAAIQQ AQAHGLQVFL KPQIHSYNPA SAAFGGNPYN NLINPDPSNP
LIIPNLDLFF EGYKAYIVEW AELAERYQVP LFSVGNEMVA VTSAEFTPYW EDIIASVRNV
YHGQLTYAAM TDVKWDSNDE VSHIEFWDKL DYVGVDMYPD FDTGATIPTT PTVEQLNDIW
VEQKWQSYLS AIAEATGKPL LFTETGVASF LGGANRSRYT DALISQMGTV RDDATQTNWF
QSFAETWMGE NQPEWFGGMY FWNNDPPYNA GLQDITGYTF FGKPAEVVVS SLFDAVNSLD
FDQTLFLASD SDDRIALYKY IAEADANPLT RAQSYHSTVI IELNGTILEG AEAVTPTIHF
YLNGKDYGAV TLSNVESEYS INKGIEAASK GEAYYPHSTL IPFLFEIDEL EVRDIHIVRD
SVQVENSEVY ISRVTIVPDM GAATVNTTVN SLQNAWLAFE EPSQAWGGAT GYQFPNGAIP
YDVPSVTIDT SPYKKTLATM SGTPDNPITV KGYEGFDTVY LLGSPEQYTI TLEGDMLMVA
ESSGLGQNSQ LGGVERLLFA EADYALLFGG MGNDTLYGGA GNDRFNGGDG NDVVLLSGYA
TEYEVSDNEA SATYTITDSV AGRDGSYQLS NMEALQFGAS PMQWNLEEFR AALAASQLLP
QQEEPFNVTG SVTFWKNGAA ISNVATTLSL HSVTNNGEEL LFQHLQHHAD GGYSVEVWAN
ATDALHSLQF EFQLPTNAQA AWHFSEEVPQ GWQTGVNNQG ADALLIGGMG ATALPSGLVQ
LGTLSFVAPT DADRLEIALT RGELGKQWLV PATITLESNV LASNGGYQHN ALWQGSYHLS
VQHESTEEPT NMVTMSDAYA ALQIAAGHNP NESEAPLQSW QFLAADINRD GKVRASDALT
ILKMALNYHD APSEELIFLP EWVGKSEMTR SSVDWSATEI MLDVENYQIV NLIGVIQGDV
DGSFS