Gene Acid345_4496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4496 
Symbol 
ID4070174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5336624 
End bp5337532 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content61% 
IMG OID637986535 
Productdihydropteroate synthase 
Protein accessionYP_593570 
Protein GI94971522 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR01496] dihydropteroate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.470248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.549745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCAGT GGAATGTCGG CAGCCGCATT GTCGAGTTGG GTAAACGCAC GCTCGTTATG 
GCCGTGCTCA ACGTGACGCC CGATTCGTTC AGCGATGGCG GGCAGTTCCC GTCGGCGGAA
GCGGCGGTCG AGGCCGGGCT GCAGATGATC GCGGACGGCG CGGACATTCT GGACATTGGT
GGAGAGTCGA CGCGGCCGGG AGTGAAGGTG GGATCGGACG CCGTGGTCTC AGCCGAGGAG
GAACTACGTC GAGTGCTTCC GGTGGTTGAG GGTCTGAAGA AAGCGCGACC GGAGATATTG
ATCAGCGCCG ATACGTATAA AGCCGAGGTG GCACGGCGCA CGGTGACCGC TGGGGCGGAG
ATCGTCAACG ACGTGAGCGG GATGAGTTGG GATCCGCAGA TGATAGGAAC CGTTGCGGAA
TTGGCTTGCG GCGTGGTGGT GATGCATTCA CGAGGTACGC CAGAGGACTG GCGAGATTTG
CCGGCGGAGC CGCGGATCGT GCAGGTCGTG CAGCAGGGCT TACATGAAGC GACACAGAAG
GCCATTGCAG CCGGAGTGCA CCACGAAAGG ATTATCGTGG ATCCGGGCTT CGGATTTGGG
AAGCGGCTGG AGAACAACTA TCCGCTGCTG GCGAAGTTGG AGGACCTCCA CAGCATTGGG
TTCCCGCTGA TGGTGGGAGT CTCTCGCAAG AGTTTCCTGG CGAAGACGGC GGGGGCGAAG
CTCGCCAGCG AGTTGAAGCC GCTGGAGAGA TTGCATCCGT CGATTGCGGC GGCGGTGATT
GCCGCGATGA AGGGCGCGCA TATCGTGCGG GTGCATGATG TGCGACCGAC GGTTGAGGCG
ATGGCGATCG TGGATGCGGT GATGCTGAGC AGCGAGGACA TGAATCCGTG GTTTGAAGCC
TATTCGTAG
 
Protein sequence
MYQWNVGSRI VELGKRTLVM AVLNVTPDSF SDGGQFPSAE AAVEAGLQMI ADGADILDIG 
GESTRPGVKV GSDAVVSAEE ELRRVLPVVE GLKKARPEIL ISADTYKAEV ARRTVTAGAE
IVNDVSGMSW DPQMIGTVAE LACGVVVMHS RGTPEDWRDL PAEPRIVQVV QQGLHEATQK
AIAAGVHHER IIVDPGFGFG KRLENNYPLL AKLEDLHSIG FPLMVGVSRK SFLAKTAGAK
LASELKPLER LHPSIAAAVI AAMKGAHIVR VHDVRPTVEA MAIVDAVMLS SEDMNPWFEA
YS