Gene Cag_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1045 
Symbol 
ID3747773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1411475 
End bp1414048 
Gene Length2574 bp 
Protein Length857 aa 
Translation table11 
GC content46% 
IMG OID637773574 
Producthypothetical protein 
Protein accessionYP_379350 
Protein GI78189012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0251387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGGAAAA AAGCGCTCAG CATTGCACTT GCAAGCTTGT TGCTATTGGG AACATCCCTA 
CCAATAGCAG CATGGCTTGC ATTACCACGT TATGTGGAAC CGTTGCTGCA ACGAGCACTT
ATTGGGAAGC CTGTACAAAT TGCCATTAAG GATGTTCGCC CCAGCTTGCA TGGTGTGGCT
TTTTCAAGCT TGCAAGCCAC AATCACCACC CCGCCTGACG AGTGCAATAA TTACGAGCGC
ACCATTTACC ATGTTACTAT AAAGAATGGA ACAATTGGGT GGCTAATTAC CGACCTTAGT
GCAAGCCATC GCTCCCCTTT TATCCCCTCC TTGCTTGATG TAAAACTGCA CCTGCAAGCC
GATACATTGC ACCTGCAACC AACGCCCAAC ACCTTTGCCT TTAGCGATAG CCAGCCAGAA
ATAACGGTAA ACCTCAAGCT CTTTCGTAAT GAAAAGCAAG TTCTATCGGT AGTGCCGCTT
GATGCCGCTT ATGCCATTCA TGATGGCACA GTTACACGAG AGCAAATGCG TTTTGAAGGC
ATTGCATACA ACGTAGCAGT CAGTAGTTCC AACAAATGGC AGCAACTCCC CGATTCACTC
TTTGTGGCAC GAATGGTCAA CGAAGGCAAA GTGCAACCTG TAGGCAACTT TCGTGCAATT
GTTGGCTCAA AAGGCGATCC GCTGCATCCC TGCCGCATCA CGCTTAGCAA CTGTTCGGCT
GAAATTGTGG ATTGGAACGC CTCTTCACCG TTTGTGCATT TTGATAGAAA AACCAAAGCG
GGCGATTTAA CGCTCTGCAT CAACGATTTT CCTTTGCAGT CGCTTTCAAG TATAGCGTTA
CAAGCGGCAC AACAACAACC AAAAGCTCCA TCTCGTTTAG CAGCAAAAGC CCCACTACCA
CCAATGGTTG CTGGAAAAAT AAATGCCACC ATACCGCTCT CGTTTCGTGA TTCTACGATC
GTGATTCGTA ATGCGTCGGT GATTGCTAAA GCGGGGGCAA AAGTGGTGCT CTACAACAAG
CAACAGCAGC CGATGCTGTT TGTTGTAGCA AACAAAAGCG GTATGGATGA GCGCATAGTG
GATAAGCTGT ATGTTACCGC AACGCTTAAC CATGCAGGCA AAACCACCCA ATCGGTTGCT
TTACAAAACC TTTCAGCCAC CATTTTTGAT GGCTCCATTC GTTCCACACC CTTAACCGTA
AAAACGGACG GTTCTTCGCC GCTTGATGTA ACGGTTACCT TTGATAACCT GAAGCTTTTT
GATCACCTTA TTTTGCCTGA TAACGAGCAA AGCTCTTTTC AGGGGGCATT AAGTGGTAAA
TTGCCAATAC GTTATGCAAA GAATCAACTT ACTATTCGCA ACGCGTCGCT ACTTGCAAGC
GAAGGCACGC AAGTAAAGCT TGTTACTAAA GAGCAAAAAC CCTTAGTAAC TATTATTGCT
GGTAAAAAAG GGGGAAAAGA AACTGTTCTT GATAAGCTGA ATGTTAGGGC ACGGTTTAAC
CAGACGCCCA ACCAAACGGC TTCCATTACA TTACAAGAGT TTTCAACAAC TCTTTTTGGC
GGCTCAGTTA ATGTTACTCC ATTAACATTT AAAACCGACG CTTCTTCGCC GCTTGTTGCA
ACCGTTACGT TGGATAAGGT AAAGCTGTTT GAGCACCTTA TTTTGCCTGC CAATCTGCAT
GGCTCGCTTT ATGGCGACCT GAGCGGTAAG GTGCCTCTCA CCTATCAAAA CGATCAACTT
TCCATAAGCA ACGCTACCCT GCGTTCATCG GGAGGTGGCT CTTTTACCCT CAACAATGCT
CAGCAATCAT CCAATAACAA CCTCAGCCGT TCCGATCAGC AAACCACCTA CGCTTTTAGC
GAACCAGCAC TAACCTTTTC GCACCTTGCA AATGGAGCCA CAACGGTTGA TTTTACGCTA
AACGAGTTTC GCCAAAAAAG TGGCAGTAAC GATTTTAAGT TTGGCAATCC AAAAGGCACC
ATCCATTTTG CCGAAAATCC TCGCGAACCC GATGTAATGC GTTTAAGCAA CTTTTCAACC
AACTTTTTTG GTGGCAAAAT TGCGCTTAAT GAATTTGTGT ATGATATAAA AAAGCAAGAA
GGAGAAACCA TTGTGCAACT AAGCAATATG CCATTGCAAA AATTGCTTGA CTTGCAAGGC
ACCAAAAAGG TTTATGCAAC AGGCGCTTTA AAGGGAAATA TTCCTATAAA GCTTAAAAAA
GGCACCGTTG AAATTCCCGA TGGAGCACTT TTAGCGCAAG AATCGGGGCA GATTATTTAT
GCAACTTCGC CCGAAGAGCG AGCCGCCGCA CACCAAAGCT TGCGCACTAC CTACGAGGTA
CTTTCCAACT TTTTGTACCA GCAACTCTCC ACCTCGCTTA CCATGACGCC CGATGGGCAA
TCAACGTTTG CAATTCGCTT AAAGGGAACC AATCCCGATA TGTATGGCGC ACGCCCTGTT
GAACTGAACT TGAATGTGCA GCAAAACCTG CTTGACCTTA TGCGCACGCT CTCCATTTCT
TCGGAAATTG AGCAAGCGAT ATCCGATAAA ACCACACAGC AGCAAAAGAA ATAG
 
Protein sequence
MRKKALSIAL ASLLLLGTSL PIAAWLALPR YVEPLLQRAL IGKPVQIAIK DVRPSLHGVA 
FSSLQATITT PPDECNNYER TIYHVTIKNG TIGWLITDLS ASHRSPFIPS LLDVKLHLQA
DTLHLQPTPN TFAFSDSQPE ITVNLKLFRN EKQVLSVVPL DAAYAIHDGT VTREQMRFEG
IAYNVAVSSS NKWQQLPDSL FVARMVNEGK VQPVGNFRAI VGSKGDPLHP CRITLSNCSA
EIVDWNASSP FVHFDRKTKA GDLTLCINDF PLQSLSSIAL QAAQQQPKAP SRLAAKAPLP
PMVAGKINAT IPLSFRDSTI VIRNASVIAK AGAKVVLYNK QQQPMLFVVA NKSGMDERIV
DKLYVTATLN HAGKTTQSVA LQNLSATIFD GSIRSTPLTV KTDGSSPLDV TVTFDNLKLF
DHLILPDNEQ SSFQGALSGK LPIRYAKNQL TIRNASLLAS EGTQVKLVTK EQKPLVTIIA
GKKGGKETVL DKLNVRARFN QTPNQTASIT LQEFSTTLFG GSVNVTPLTF KTDASSPLVA
TVTLDKVKLF EHLILPANLH GSLYGDLSGK VPLTYQNDQL SISNATLRSS GGGSFTLNNA
QQSSNNNLSR SDQQTTYAFS EPALTFSHLA NGATTVDFTL NEFRQKSGSN DFKFGNPKGT
IHFAENPREP DVMRLSNFST NFFGGKIALN EFVYDIKKQE GETIVQLSNM PLQKLLDLQG
TKKVYATGAL KGNIPIKLKK GTVEIPDGAL LAQESGQIIY ATSPEERAAA HQSLRTTYEV
LSNFLYQQLS TSLTMTPDGQ STFAIRLKGT NPDMYGARPV ELNLNVQQNL LDLMRTLSIS
SEIEQAISDK TTQQQKK