Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1722 |
Symbol | |
ID | 6375409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1865774 |
End bp | 1867072 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642684215 |
Product | homoaconitate hydratase family protein |
Protein accession | YP_001960121 |
Protein GI | 189500651 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR01343] homoaconitate hydratase family protein [TIGR02086] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.410724 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAAA CAATAACGCA GAAAATTCTC GCCAGAGCCG CCAACCGCAA ATTTGTCGAG GCGGGAGAAA ATATCTGGCT CAACGTTGAC GTCCTGCTCA CCCATGATGT GTGTGGCCCG CCGACCTTCG ACATTTTCAG GGAAAAATTC GGTCCGGAAG CAAAAGTGTG GGACCCGGAA AAGATTGTGG TTCTTCCTGA TCACTACATA TTCACTGAAA ACGAGCACGC ACACCGCAAC ATCGATCTTC TTCGTGAGTT CGCCAACGAG CAGAAACTGC CAAACTACTA CGATGTAGGA ACAAAACGCT ACAAAGGGGT CTGTCATGTC GCTCTTGCCC AGGAAGGCTA CAACCTCCCC GGCACGGTTC TGTTCGGAAC CGATTCGCAT ACCTGTACCT CGGGCGCGTT CGGCATGTTC GGAACAGGAA TAGGCAACAC AGATGCCGCC TTTATCCTCG GCACAGGCAA GATCTGGGAA AAAGTACCGG AATCAATGAA GTTCATCTTT GAAGGCGAGA TGCCGGAATA CCTGATGGCC AAAGATCTGA TCCTGCAGAT ACTGGGAGAT ATCTCTACCG ACGGAGCGAC CTATCGTGCC ATGGAGTTTG ACGGTTCAGC AGTGTTCTCG TTGCCGATGG AAGAGCGAAT GACGCTCACC AATATGGCGA TCGAAGCCGG TGGAATGAGT GGAATCATCG CAGCCGATTC GATCACGGAA GCCTATGTCA AAGAACGCAG CGACAAGCCC TACGAAATAT TTACCAGCGA CCCCGACGCC CTGTATCACA GCATCCACCG CTACAAAACA GAAGAGCTCG AACCGGTTGT CGCCATGCCG CACAGCCCGG ACAACCGCGC GACGGTTCGA AGCGTTCAGG GCACGAAAAT AACCAAATCA TATATCGGCT CATGCACCGG AGGTAAACTC ATCGACTTCG TTATGGCGGC AAAAGTCCTG AAAGGAAACA CGGTTTCCGT ACCGACTAAT ATTGTACCCG CAACCGTTGA AGTCGCCCAA AGCCTTACAA CAGAAGAGTT CGACGGTCAG CCGATCATAA CAATCCTTAA AGAGGCCGGC TGCACAATCG CTCCTCCTTC ATGCGCGGCA TGCCTTGGCG GTCCATCCGA TACCTTCGGA CGTTCCGTAG ATAACGATCT TGTTGTCTCT ACAACAAACA GGAACTTTCC CGGACGTATG GGAAGCAAAA AAGCAGGCGT CTGCCTCGCT TCACCGCTCA CAGCTGCCGC TTCGGCAATT ACCGGAAAAC TTACTGATCC GAGAGAATTT CTCAGATAA
|
Protein sequence | MTQTITQKIL ARAANRKFVE AGENIWLNVD VLLTHDVCGP PTFDIFREKF GPEAKVWDPE KIVVLPDHYI FTENEHAHRN IDLLREFANE QKLPNYYDVG TKRYKGVCHV ALAQEGYNLP GTVLFGTDSH TCTSGAFGMF GTGIGNTDAA FILGTGKIWE KVPESMKFIF EGEMPEYLMA KDLILQILGD ISTDGATYRA MEFDGSAVFS LPMEERMTLT NMAIEAGGMS GIIAADSITE AYVKERSDKP YEIFTSDPDA LYHSIHRYKT EELEPVVAMP HSPDNRATVR SVQGTKITKS YIGSCTGGKL IDFVMAAKVL KGNTVSVPTN IVPATVEVAQ SLTTEEFDGQ PIITILKEAG CTIAPPSCAA CLGGPSDTFG RSVDNDLVVS TTNRNFPGRM GSKKAGVCLA SPLTAAASAI TGKLTDPREF LR
|
| |