Gene Cag_1699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1699 
Symbol 
ID3746388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2206529 
End bp2208310 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content47% 
IMG OID637774236 
Producthypothetical protein 
Protein accessionYP_379993 
Protein GI78189655 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00360282 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGCT TCTGCCAACA ACTTTTTTCA TGCTTTAGCT TAACTACCCG ACGCACCATG 
AAACCAACAT TTCATTTCAC CCTTATTTGT TGTGCTGTTG CACTTTTGGC AAGCAATCAA
CAAACCTTTG CAGGTAATGA CGTGCTGCAT CAAAATAGTG CCTCGCTCCA AAGTTCACGC
TTGCAAAATG AGCCACAACG TGCGCCAACG CCATCGGTGA TTGCCTTTCA ACCCAATCAA
AAAAGTGTTA CAGGAACCAT TGGCAACGAT CGCATTACGC TCTATGGCAA TAGTGGTTTG
GCAAGTACCG CAACACCCGT GCGCGATCCC GCAATTTCGC GCTCGCAAAC CTTTAGCTCT
ACCGTGGCGA TTGATTTAAA TGGTACCATT TTGCCCGGTA CAGCGGCTGA AACCCCTACG
GTACATTTTT ACATTAATGG CAAAGATTAT GGCTTAGCTA CGCTCAGTAG TGTACAAAGT
GATTACAGTA AAAAAATTGG CGGCATTGCC CACAGTGGCA AGCAACGCTT TATTTTTCCT
GTAGATGATA TTGATATTCG CACCATCAAA ATTGAAATTG AGTCGCCTGC CGTACTGCGT
TCCGAAGTCT ATATTTATGG CGTTACCATT ACGCCCGAAG GCGCGGCTGA CCAAAAAGTT
GAACCCAGCT CGTTGCGTGG TGCCACGGTA ACCTTTGCCA CGCCATCGCG CTATTATGAT
GGTGGAAAAA GCTATAAAAT CCCTTACGGC TCAATTCCAA GCGATGTTCG ATCTATTACT
ATTGATACCT CACTCTATCG CAAAACATTG CAGCAAGCGC CTGGCACTCC AGCAAATCCA
TTAACAGTGC ATGGCGGTGG CGGCATTGAT ACACTTTACT TGCTTGGTAA CCAAGATCAG
TACGTGTTGG CAGGTGGTAA AAATAGTTCG CTTATTATTG CTGAATCGGC GGGGTTAAGC
CAAAATGCGC TCGCCACTAA CATTGCAAAA GTTGAATTTG CTGATAGCTC TTTCTTTTTA
CCTCCGCAAG CTACCGGCAT CAATGAAGCG GTATTGGCAG AAAACGAGGC ATCAATAGCT
ACACTTCGCG CCACCTTGCC ACCCCATGTG GGCATGGCAG TTCATCCCAT AAAGCATGAC
CCGCTACGTG GCAAAATAGG CGATGTATTA AGCAAAAATT GCCCCACCGA GTTTTACCAA
AATGCTGTGC GTTTAACGGC ACCGCAAGGT GATGCTCCCA TGGCTCTACG CTCTTTGGGA
TTTGTTATTG CACCAGCACG CCGTGATGCC CCTACCATTA AGTTGCAAGG TGCTGCAAAA
GAGCAAGAAA TGGTGGTAGT TGATACAGCC ACTTTGCCAG AAACTTCATT GATTGAGGCA
CAGCGCGTTA ATGTTGTTTT GTTGAGCGGC AAAACGCCGA TAACGTTTCG CGGAAATGGT
GATGGCATGG TAATTCTTGC GGACGAAGGC AATCAAGAGA TGTGGGGCGG CACGGGCGAT
GACCTATTAT GGGGAGGTGT GGGAAATGAT AATTTGTATG GGGGAATTGA TGATGACCTG
CTTTGCGGCG GCAGTGGCGA TGATGTGTTA GATGGAGGTT CAGGCATTGA TGCCGCCTAC
TTTAGCGGCA AAAGCGAAGA GTACCGCATT ACGCACCATC CCACAACCAG CATGACCACC
GTTGTGGATT TGGTTGCTGA ACGCGATGGT ACCGATAACC TCTTTAATAT TGAGCAATTA
CGTTTTGCCG ATAGAACTAT GTTTTTAGGC GATAAAAAGT AG
 
Protein sequence
MDSFCQQLFS CFSLTTRRTM KPTFHFTLIC CAVALLASNQ QTFAGNDVLH QNSASLQSSR 
LQNEPQRAPT PSVIAFQPNQ KSVTGTIGND RITLYGNSGL ASTATPVRDP AISRSQTFSS
TVAIDLNGTI LPGTAAETPT VHFYINGKDY GLATLSSVQS DYSKKIGGIA HSGKQRFIFP
VDDIDIRTIK IEIESPAVLR SEVYIYGVTI TPEGAADQKV EPSSLRGATV TFATPSRYYD
GGKSYKIPYG SIPSDVRSIT IDTSLYRKTL QQAPGTPANP LTVHGGGGID TLYLLGNQDQ
YVLAGGKNSS LIIAESAGLS QNALATNIAK VEFADSSFFL PPQATGINEA VLAENEASIA
TLRATLPPHV GMAVHPIKHD PLRGKIGDVL SKNCPTEFYQ NAVRLTAPQG DAPMALRSLG
FVIAPARRDA PTIKLQGAAK EQEMVVVDTA TLPETSLIEA QRVNVVLLSG KTPITFRGNG
DGMVILADEG NQEMWGGTGD DLLWGGVGND NLYGGIDDDL LCGGSGDDVL DGGSGIDAAY
FSGKSEEYRI THHPTTSMTT VVDLVAERDG TDNLFNIEQL RFADRTMFLG DKK