Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1541 |
Symbol | |
ID | 3746600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2020049 |
End bp | 2021623 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637774081 |
Product | phosphodiesterase |
Protein accession | YP_379839 |
Protein GI | 78189501 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00592062 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATTG TTATAAACCT CTTATTGCTT GTATTAGCGG CTCTTGTGGC GTTTGTTGCA GGCTTTTTTA TTGGGCGCTA CTTTCTTGAG CGCATTGGTA CTACAAAGGT TTTAGAGGCT GAAGAACGAG CGGTGCAAAT TGTGCAGGAA GCTCAAAAAG AGGCAAATGA GTACAAGGAA TTAAAGGTTA GCGAAGTTAA TCAGGAGTGG AAAAAAAAGC GTCGTGAGTT TGAGCAAGAT GTGCTTATTA AGAACAACAA ATTTGCACAG TTACAAAAGC AGTTGCAGCA ACGCGAAGCG CAACTGAAAA AGCAATCGCA AGATGTGCGC GATGCTGAGC GCAAATTGCA AGATCAGCGC AAAGAAGTAG AGCAGTTAAG TGATTCGGTG AAGCTTCGTG CTACTGAGCT TGAGCGCGTT ATTGTGGAGC AAAATCAACG TCTCGAAAGC ATTAGCAATC TGCAAGCTGA TGAGGCTCGC CAAATGCTTA TTGATAATAT GGTTACACAA GCACGCGAAG AAGCAAGCAA CACCATTCAC CGCATTCACG AAGAGGCTGA GCAGCAAGCC ACGCGCATGG CAGAAAAAAC CCTCATTACG GCTATCCAGC GCATCTCTTT TGAGCAAACC ACTGAAAATG CTCTTTCGGT AGTTCACATT CAAAGTGATG AATTAAAAGG GCGCATTATT GGTCGTGAAG GGCGCAACAT TAAAGCTTTT GAAAATGCTA CTGGGGTTGA CATTATTGTT GACGATACCC CCGAAGTGGT TATTCTCTCC TGCTTTGATC CCTTGCGCCG AGAGCTGGCA AAACTCACCC TTAAAAAATT GCTTGCCGAT GGCATTATTC ATCCCGTAGC TATTGAAAAA GCTTATGCGG ATGCTACCAA AGAGATTGAC GATGTTGTCT ATAGTGCGGG CGAAGAGGTG GCGGCATCGC TCCAACTTAA CGACATTCCC ACCGAAGTGA TTGCGCTTCT TGGCAAAATG AAGTTCCACA CCGTGTATGG GCAGAACTTG CTACAACATA GCCGTGAAGT AGCAATGCTT GCAGGCGTTA TGGCGGCAGA GTTAAAGCTT GATGCACGTA TGGCAAAACG GGCAGGTTTA TTGCACGATA TTGGCTTAGT GCTGCCCGAA AGCGATGAGC CACATGCAAT TACGGGCATG AATTTTATGA AGAAATTTAA TGAGTCAGAC CAACTGCTTA ACGCTATTGG CGCTCACCAT GGTGATATGG AAAAAGAGTC GCCACTTGCC GATTTAGTTG ATGCCGCCAA CACCATTTCG CTTTCACGTC CCGGTGCGCG TGGTGCCGTA ACGGCTGATG GCAACGTTAA ACGCCTTGAA AGCCTTGAAG AAATTGCAAA GGGCTTCCCT GGAGTGTTAA AGACCTATGC GTTACAAGCA GGGCGCGAAA TTCGTGTGAT TGTGGAAGGC GATAACGTCA GCGATTCGCA AGCCGATATG CTTGCCCACG ATATTGCTCG TAAAATTGAG TCGGAAGCGC AATATCCCGG TCAAATTAAA GTTTCCATTA TTCGCGAAAA GCGTTCAGTG GCTTACGCCA AGTAA
|
Protein sequence | MGIVINLLLL VLAALVAFVA GFFIGRYFLE RIGTTKVLEA EERAVQIVQE AQKEANEYKE LKVSEVNQEW KKKRREFEQD VLIKNNKFAQ LQKQLQQREA QLKKQSQDVR DAERKLQDQR KEVEQLSDSV KLRATELERV IVEQNQRLES ISNLQADEAR QMLIDNMVTQ AREEASNTIH RIHEEAEQQA TRMAEKTLIT AIQRISFEQT TENALSVVHI QSDELKGRII GREGRNIKAF ENATGVDIIV DDTPEVVILS CFDPLRRELA KLTLKKLLAD GIIHPVAIEK AYADATKEID DVVYSAGEEV AASLQLNDIP TEVIALLGKM KFHTVYGQNL LQHSREVAML AGVMAAELKL DARMAKRAGL LHDIGLVLPE SDEPHAITGM NFMKKFNESD QLLNAIGAHH GDMEKESPLA DLVDAANTIS LSRPGARGAV TADGNVKRLE SLEEIAKGFP GVLKTYALQA GREIRVIVEG DNVSDSQADM LAHDIARKIE SEAQYPGQIK VSIIREKRSV AYAK
|
| |