Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0290 |
Symbol | |
ID | 3748119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 316631 |
End bp | 319576 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637772816 |
Product | hypothetical protein |
Protein accession | YP_378609 |
Protein GI | 78188271 |
COG category | [R] General function prediction only |
COG ID | [COG1752] Predicted esterase of the alpha-beta hydrolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCGC CATTCCCCCT TTTTTTCCCT CTTTTACAAA AAAAGAGAGA TTGCTCGAAG CAGGCTCAGT GCTCGAAACG GCGTTTGATT ATGGCGTTGC TGGCAGTGAG CGCACTTGCA CCAGCGAACC TTTATGCGGC ATCAAAAGCA AAACGTCAGC CGCCATTGGT TGCGCATGTT TACCCCGACA CGTTGGCGTT GCCCTATAAT CGTTACGCAC TAAAGCCCTT TATGCGTCCT GCGCGCAAAA GTGTTGCGGT GGCGCTGTCA GGGGGAGGGG CAAATGCGTT GGCTCAAATT GGGGTGCTAA AGGCGTTTGA GGAGGCGCAC ATTCCTGTGG ATGCCATTGC GGGCACCAGC ATGGGGGCAA TTATTGGCGG CTTGTACAGT TGTGGCTATA GTGCGGCTGA ATTGGAGCAA TTGGCGCTTA CCATGCCATG GAGTTCCATT TTAGCCTTAC AGGAAGATTA TAGCCGTTCG TCGCTTTTTG TGGAGCAGCA GCGCATTCGT GATAGAGCAA CCATTGCGTT GCGTTTTGAT GGCTTAAAGT TGCTGTTGCC GCAGTCGCTT AATTCGGCGC AAGCGTTTAC ACGCACTATG GATATGTTGG TGCTTCATGC GTTGTACCAT CCGCACAGCA ATTTTTCAAG TCTGCCGATA GCTTTTCGGG CGGTAACTAC CGATTTAGTA AGTGGTGAAC GGGTAACGCT TGAGAGTGGC TCGCTTTCGG AAGCTATGCG GGCAAGTAGT ACCGTGCCTA TTTTGTTTGA GCCGATTCAT CGTGCAGAAC AGCAGTTAGT GGATGGGGGA TTGGTTGCAA ATTTGCCTGT TGATGAGTTA GCGCATTTTG GGGCTGATTG CAAAATTGCC ATTGATACAC ATGGCAGCAT GTACGCCACC GGTAAGGAGC TTGATCTCCC TTGGAAAGCG GCTGACCAAG CTATGACCAT TCTTATCACG TTGCAATATC CTGCCCAGCG AGCACAAGCT TCGTTGGTTA TTGAGCCTGA AACAGGCAAG CATAAGGCAA CCGATTTTAA AAACATTCCT CAATTGATAG CGGCTGGTTA TGTGGCAGGC AAGCAGCAAG TGCCAACGTT ACAGCGCTTG CTTGCCATTA CTTCTCCCTC AAACTCGTCA GCCCCACAAA CCTCATCCGT TCCACCATCG TCAATAGTTC CATCCGTTGC AACTCCCCCC CCCATTTCCC CTATCCTTAC TGCCAACAAG AAGGAGATGC GGAATTTTTC GCTTGCCACT TACACCAAGC GCTGGAGTAT ATCCCCCACA TCAACGGAGT TAGAGCGGTT GGTTGGCGAA AAGGTGGCAT CGGCTTTGGA GCTTCATGCG CTTTTACGCG ATTTGCTTGC CACCGATTAC TTTGCTCGTG TGTCGGCTGA AGTTCATCAA GAAGATCGAA CAGTAACGGT AAAGCTTGAA GCCTTGCCAT CAGTTACGGT AGTTACGGTG CAGGGTGAAT TAGCGGATGA GCTGAGTAGT GCGGAACTTA ACGAATGCTT TGCGCCACTC ATGGGCAGGC TTTATACAAA CCATCAAGCT ACGGCGGCAC TTGAGGCACT TGTGCGGCGT TTGCGGGCTA AAGGGTATAG TTTAGCTGCC ATTGAGCAGG TTCATGTAGA GAACGAACGC TTAACCATCA CCTTTTCATC GGGTAAGGCG GCAATGCTTA CCATTTCGCT CAATAAAGGG CGAACGCTTT TAACGCCTAT TCAACGTGAG TTAAAATTAG ATGCAACAAA ACCTTTACGC TTGCGAGCGG CTGAAGAGTC GGTTAAAAAC CTTTATGAAA CGGGTGTTTT TAACCGTGTT TCACTTTTTG CCGAACCTAT CACGCAGACT GAAGCAATAG CTCCAATATC CTCAACAACC CCAAACCAAA CGATTCATCT TTCGCTTGAA GAAAAACCAG CCTCCGTGTT GCGTCTTGGC TTGCGTTACG ATGAAACCAA CAACGCGCAA TTGTTGCTTG ATGTGCGCAA TGAAAATGTT GGGGGAACTA CGAACACTAT GGGGGGATGG GTTAAGGCTG GTAGAAAAGG GTATCTCGCC AATATGGAAC TTAATATGCC ACGCATTGGT GCAACCCATT TAATTTTTGC CACGCGTTTA TTTTTTGACT CTTACCTCTT TGATTACACC AACTCTGATG GCTCGCTGGC ACCTTACAAC ATTCAAAAAT ATGGTATAAC CTCCTCGTTT GGCACACGCT TGCGTAAAAA TGGGCACTTC TTAACCGATG TTTCCTATTA CAACAGCCAA GCCTTTACCG ATGAAGCGCA TCGTCCACTA TTTTCGACCA CCAACAATAA CGTGCTCACC ATTGGCACGC ATTTGACCAT TGATTCACGC AATAACGCGC TTATGCCTAC ACGCGGTAGC TATAGCTATC TCACCTACGC ATTTACACCG TTAAGCCTTG ATGATGGATT GCGCTATTGG CAGTTTTCGG GCACCCATCA AGTCAATCTG CCACTTGGGC GCGAAACCAC CTTGCAACTT TCAGCAATGA CGGGGGTTAG TAGTAAAGCA CTTCCCTTAT CGGAACAATA TTTTTTAGGG GGAATAGGTA ACAGCTATAG TGCGCGATTT ATTGGGTTAC AGCCACATGC GCTTGCCACA AACAACGTTG CAACCGCAGG CGTGCAGCTC TCTTATGAGC CCTCCTTCCC CATTCTTTTC CCCACCACAC TGCAACTGCA TTACAATGCG GGGAGGGGAT GGAACGCAAT GGAAAACGTC CGCTTGGATG GGGCATTACA AGGGGCATTG CAAGCGGTTG GGGCAAGCAT GGTATGGAAA ACACCGCTTG GTCCCACGCG CTTTACGTTG GCAAAAGTGT TAGTCAATAA CGATGATAAC AGCCTCATGC TTCCACACCG TGATGACGAC CCTGTTTTTT ACTTCAGCAT TGGGCACGAT TTTTAG
|
Protein sequence | MKSPFPLFFP LLQKKRDCSK QAQCSKRRLI MALLAVSALA PANLYAASKA KRQPPLVAHV YPDTLALPYN RYALKPFMRP ARKSVAVALS GGGANALAQI GVLKAFEEAH IPVDAIAGTS MGAIIGGLYS CGYSAAELEQ LALTMPWSSI LALQEDYSRS SLFVEQQRIR DRATIALRFD GLKLLLPQSL NSAQAFTRTM DMLVLHALYH PHSNFSSLPI AFRAVTTDLV SGERVTLESG SLSEAMRASS TVPILFEPIH RAEQQLVDGG LVANLPVDEL AHFGADCKIA IDTHGSMYAT GKELDLPWKA ADQAMTILIT LQYPAQRAQA SLVIEPETGK HKATDFKNIP QLIAAGYVAG KQQVPTLQRL LAITSPSNSS APQTSSVPPS SIVPSVATPP PISPILTANK KEMRNFSLAT YTKRWSISPT STELERLVGE KVASALELHA LLRDLLATDY FARVSAEVHQ EDRTVTVKLE ALPSVTVVTV QGELADELSS AELNECFAPL MGRLYTNHQA TAALEALVRR LRAKGYSLAA IEQVHVENER LTITFSSGKA AMLTISLNKG RTLLTPIQRE LKLDATKPLR LRAAEESVKN LYETGVFNRV SLFAEPITQT EAIAPISSTT PNQTIHLSLE EKPASVLRLG LRYDETNNAQ LLLDVRNENV GGTTNTMGGW VKAGRKGYLA NMELNMPRIG ATHLIFATRL FFDSYLFDYT NSDGSLAPYN IQKYGITSSF GTRLRKNGHF LTDVSYYNSQ AFTDEAHRPL FSTTNNNVLT IGTHLTIDSR NNALMPTRGS YSYLTYAFTP LSLDDGLRYW QFSGTHQVNL PLGRETTLQL SAMTGVSSKA LPLSEQYFLG GIGNSYSARF IGLQPHALAT NNVATAGVQL SYEPSFPILF PTTLQLHYNA GRGWNAMENV RLDGALQGAL QAVGASMVWK TPLGPTRFTL AKVLVNNDDN SLMLPHRDDD PVFYFSIGHD F
|
| |