Gene Cag_0290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0290 
Symbol 
ID3748119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp316631 
End bp319576 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content49% 
IMG OID637772816 
Producthypothetical protein 
Protein accessionYP_378609 
Protein GI78188271 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCGC CATTCCCCCT TTTTTTCCCT CTTTTACAAA AAAAGAGAGA TTGCTCGAAG 
CAGGCTCAGT GCTCGAAACG GCGTTTGATT ATGGCGTTGC TGGCAGTGAG CGCACTTGCA
CCAGCGAACC TTTATGCGGC ATCAAAAGCA AAACGTCAGC CGCCATTGGT TGCGCATGTT
TACCCCGACA CGTTGGCGTT GCCCTATAAT CGTTACGCAC TAAAGCCCTT TATGCGTCCT
GCGCGCAAAA GTGTTGCGGT GGCGCTGTCA GGGGGAGGGG CAAATGCGTT GGCTCAAATT
GGGGTGCTAA AGGCGTTTGA GGAGGCGCAC ATTCCTGTGG ATGCCATTGC GGGCACCAGC
ATGGGGGCAA TTATTGGCGG CTTGTACAGT TGTGGCTATA GTGCGGCTGA ATTGGAGCAA
TTGGCGCTTA CCATGCCATG GAGTTCCATT TTAGCCTTAC AGGAAGATTA TAGCCGTTCG
TCGCTTTTTG TGGAGCAGCA GCGCATTCGT GATAGAGCAA CCATTGCGTT GCGTTTTGAT
GGCTTAAAGT TGCTGTTGCC GCAGTCGCTT AATTCGGCGC AAGCGTTTAC ACGCACTATG
GATATGTTGG TGCTTCATGC GTTGTACCAT CCGCACAGCA ATTTTTCAAG TCTGCCGATA
GCTTTTCGGG CGGTAACTAC CGATTTAGTA AGTGGTGAAC GGGTAACGCT TGAGAGTGGC
TCGCTTTCGG AAGCTATGCG GGCAAGTAGT ACCGTGCCTA TTTTGTTTGA GCCGATTCAT
CGTGCAGAAC AGCAGTTAGT GGATGGGGGA TTGGTTGCAA ATTTGCCTGT TGATGAGTTA
GCGCATTTTG GGGCTGATTG CAAAATTGCC ATTGATACAC ATGGCAGCAT GTACGCCACC
GGTAAGGAGC TTGATCTCCC TTGGAAAGCG GCTGACCAAG CTATGACCAT TCTTATCACG
TTGCAATATC CTGCCCAGCG AGCACAAGCT TCGTTGGTTA TTGAGCCTGA AACAGGCAAG
CATAAGGCAA CCGATTTTAA AAACATTCCT CAATTGATAG CGGCTGGTTA TGTGGCAGGC
AAGCAGCAAG TGCCAACGTT ACAGCGCTTG CTTGCCATTA CTTCTCCCTC AAACTCGTCA
GCCCCACAAA CCTCATCCGT TCCACCATCG TCAATAGTTC CATCCGTTGC AACTCCCCCC
CCCATTTCCC CTATCCTTAC TGCCAACAAG AAGGAGATGC GGAATTTTTC GCTTGCCACT
TACACCAAGC GCTGGAGTAT ATCCCCCACA TCAACGGAGT TAGAGCGGTT GGTTGGCGAA
AAGGTGGCAT CGGCTTTGGA GCTTCATGCG CTTTTACGCG ATTTGCTTGC CACCGATTAC
TTTGCTCGTG TGTCGGCTGA AGTTCATCAA GAAGATCGAA CAGTAACGGT AAAGCTTGAA
GCCTTGCCAT CAGTTACGGT AGTTACGGTG CAGGGTGAAT TAGCGGATGA GCTGAGTAGT
GCGGAACTTA ACGAATGCTT TGCGCCACTC ATGGGCAGGC TTTATACAAA CCATCAAGCT
ACGGCGGCAC TTGAGGCACT TGTGCGGCGT TTGCGGGCTA AAGGGTATAG TTTAGCTGCC
ATTGAGCAGG TTCATGTAGA GAACGAACGC TTAACCATCA CCTTTTCATC GGGTAAGGCG
GCAATGCTTA CCATTTCGCT CAATAAAGGG CGAACGCTTT TAACGCCTAT TCAACGTGAG
TTAAAATTAG ATGCAACAAA ACCTTTACGC TTGCGAGCGG CTGAAGAGTC GGTTAAAAAC
CTTTATGAAA CGGGTGTTTT TAACCGTGTT TCACTTTTTG CCGAACCTAT CACGCAGACT
GAAGCAATAG CTCCAATATC CTCAACAACC CCAAACCAAA CGATTCATCT TTCGCTTGAA
GAAAAACCAG CCTCCGTGTT GCGTCTTGGC TTGCGTTACG ATGAAACCAA CAACGCGCAA
TTGTTGCTTG ATGTGCGCAA TGAAAATGTT GGGGGAACTA CGAACACTAT GGGGGGATGG
GTTAAGGCTG GTAGAAAAGG GTATCTCGCC AATATGGAAC TTAATATGCC ACGCATTGGT
GCAACCCATT TAATTTTTGC CACGCGTTTA TTTTTTGACT CTTACCTCTT TGATTACACC
AACTCTGATG GCTCGCTGGC ACCTTACAAC ATTCAAAAAT ATGGTATAAC CTCCTCGTTT
GGCACACGCT TGCGTAAAAA TGGGCACTTC TTAACCGATG TTTCCTATTA CAACAGCCAA
GCCTTTACCG ATGAAGCGCA TCGTCCACTA TTTTCGACCA CCAACAATAA CGTGCTCACC
ATTGGCACGC ATTTGACCAT TGATTCACGC AATAACGCGC TTATGCCTAC ACGCGGTAGC
TATAGCTATC TCACCTACGC ATTTACACCG TTAAGCCTTG ATGATGGATT GCGCTATTGG
CAGTTTTCGG GCACCCATCA AGTCAATCTG CCACTTGGGC GCGAAACCAC CTTGCAACTT
TCAGCAATGA CGGGGGTTAG TAGTAAAGCA CTTCCCTTAT CGGAACAATA TTTTTTAGGG
GGAATAGGTA ACAGCTATAG TGCGCGATTT ATTGGGTTAC AGCCACATGC GCTTGCCACA
AACAACGTTG CAACCGCAGG CGTGCAGCTC TCTTATGAGC CCTCCTTCCC CATTCTTTTC
CCCACCACAC TGCAACTGCA TTACAATGCG GGGAGGGGAT GGAACGCAAT GGAAAACGTC
CGCTTGGATG GGGCATTACA AGGGGCATTG CAAGCGGTTG GGGCAAGCAT GGTATGGAAA
ACACCGCTTG GTCCCACGCG CTTTACGTTG GCAAAAGTGT TAGTCAATAA CGATGATAAC
AGCCTCATGC TTCCACACCG TGATGACGAC CCTGTTTTTT ACTTCAGCAT TGGGCACGAT
TTTTAG
 
Protein sequence
MKSPFPLFFP LLQKKRDCSK QAQCSKRRLI MALLAVSALA PANLYAASKA KRQPPLVAHV 
YPDTLALPYN RYALKPFMRP ARKSVAVALS GGGANALAQI GVLKAFEEAH IPVDAIAGTS
MGAIIGGLYS CGYSAAELEQ LALTMPWSSI LALQEDYSRS SLFVEQQRIR DRATIALRFD
GLKLLLPQSL NSAQAFTRTM DMLVLHALYH PHSNFSSLPI AFRAVTTDLV SGERVTLESG
SLSEAMRASS TVPILFEPIH RAEQQLVDGG LVANLPVDEL AHFGADCKIA IDTHGSMYAT
GKELDLPWKA ADQAMTILIT LQYPAQRAQA SLVIEPETGK HKATDFKNIP QLIAAGYVAG
KQQVPTLQRL LAITSPSNSS APQTSSVPPS SIVPSVATPP PISPILTANK KEMRNFSLAT
YTKRWSISPT STELERLVGE KVASALELHA LLRDLLATDY FARVSAEVHQ EDRTVTVKLE
ALPSVTVVTV QGELADELSS AELNECFAPL MGRLYTNHQA TAALEALVRR LRAKGYSLAA
IEQVHVENER LTITFSSGKA AMLTISLNKG RTLLTPIQRE LKLDATKPLR LRAAEESVKN
LYETGVFNRV SLFAEPITQT EAIAPISSTT PNQTIHLSLE EKPASVLRLG LRYDETNNAQ
LLLDVRNENV GGTTNTMGGW VKAGRKGYLA NMELNMPRIG ATHLIFATRL FFDSYLFDYT
NSDGSLAPYN IQKYGITSSF GTRLRKNGHF LTDVSYYNSQ AFTDEAHRPL FSTTNNNVLT
IGTHLTIDSR NNALMPTRGS YSYLTYAFTP LSLDDGLRYW QFSGTHQVNL PLGRETTLQL
SAMTGVSSKA LPLSEQYFLG GIGNSYSARF IGLQPHALAT NNVATAGVQL SYEPSFPILF
PTTLQLHYNA GRGWNAMENV RLDGALQGAL QAVGASMVWK TPLGPTRFTL AKVLVNNDDN
SLMLPHRDDD PVFYFSIGHD F