Gene Cpha266_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0848 
Symbol 
ID4570442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp970211 
End bp971506 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content54% 
IMG OID639765446 
Producthomoaconitate hydratase family protein 
Protein accessionYP_911323 
Protein GI119356679 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.288921 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAAA CAATAACCCA GAAAATCCTT TCAAGGGCGG CAAACCGCAA GTTTGTAGAT 
GCCGGGGAAA ACGTCTGGCT GAACGTCGAT ATCCTGCTCA CACACGATGT GTGCGGACCG
CCTACCTTCG ACATCTTCAA ACAGGAATTC GGACCCGACG CGAAGGTCTG GGATCCTGAA
AAAGTGGTTG TGCTACCCGA CCACTACATC TTTACCGCTA ACGAGCATGC GCACAGAAAT
ATTGACCTCT TGCGGCAGTT TGCCCGTGAA CAGAGTCTCC CCAACTACTA CGATGTTGGC
ACAAAACGAT ATAAAGGGGT CTGCCATGTC GCACTGGCTG AAGAGGGATT CAACATTCCG
GGAACCGTGC TCTTTGGAAC CGATTCGCAC ACCTGCACGT CGGGAGCCTT CGGCATGTTC
GGCTCGGGCA TCGGCAATAC TGACGCGGCA TTCATTCTCG GCACCGGCAA ACTCTGGGAA
AAAGTCCCCG AATCCATGAA GTTCACCTTC GATGGCCAGA TGCCTGAATA CCTCACCGCC
AAAGACCTTA TCTTGCATAT TCTCGGCGAT ATCGGCACCG ATGGCGCGAC CTACCGGGCC
ATGGAATTTG ACGGCGAAGC GGTCTTCTCT CTTCCGATTG AAGAGCGAAT GACGCTTACC
AACATGGCAA TCGAGGCTGG CGGCATGAAC GGCATCATCG CGGCAGATGC CATTACCGAG
GCATACGTCA ACGCAAGAAC GCAGAAACCA TATGAGCTCT TCAGGAGCGA TCCTGATGCC
CGTTACCACA GCAGCTACAC CTACAATGTC CGGGAACTTG AGCCCGTGGT CGCATGTCCG
CACAGCCCGG ATAACCGTGC AACCGTCAGA AGCGTTCAGG GAACGGCCAT CACCAGATCC
TATATCGGCT CATGTACCGG CGGCAAACTC AGTGATTTCA TGATGGCGGC TAAAATCCTC
AACGGCAAAA AAGTATCGGT TCCCACCAAT ATCGTTCCGG CAACCGTCAA GGTAGCCGCT
GATCTTGCAA TCGAACAGTA TGAAGGGAGA ACCCTGAAGC AGATTTTCGA GGATGCCGGA
TGCAGCATCG CCCTGCCCTC CTGTGCGGCC TGTCTCGGCG GTCCGAGCGA TACCGTCGGA
CGTTCGGTTG ATAACGACGT CGTTGTGTCA ACGACAAACA GGAATTTCCC TGGACGCATG
GGCAGTAAAA AAGCCGGTGT CTACCTTGCC TCACCGTTGA CGGCAGCAGC GTCAGCCATT
ACCGGCAAGC TCACCGACCC GAGAGATTTC CTCTGA
 
Protein sequence
MAQTITQKIL SRAANRKFVD AGENVWLNVD ILLTHDVCGP PTFDIFKQEF GPDAKVWDPE 
KVVVLPDHYI FTANEHAHRN IDLLRQFARE QSLPNYYDVG TKRYKGVCHV ALAEEGFNIP
GTVLFGTDSH TCTSGAFGMF GSGIGNTDAA FILGTGKLWE KVPESMKFTF DGQMPEYLTA
KDLILHILGD IGTDGATYRA MEFDGEAVFS LPIEERMTLT NMAIEAGGMN GIIAADAITE
AYVNARTQKP YELFRSDPDA RYHSSYTYNV RELEPVVACP HSPDNRATVR SVQGTAITRS
YIGSCTGGKL SDFMMAAKIL NGKKVSVPTN IVPATVKVAA DLAIEQYEGR TLKQIFEDAG
CSIALPSCAA CLGGPSDTVG RSVDNDVVVS TTNRNFPGRM GSKKAGVYLA SPLTAAASAI
TGKLTDPRDF L