Gene Acid345_1612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1612 
Symbol 
ID4072538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1954275 
End bp1955270 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content57% 
IMG OID637983621 
Producttagatose 1,6-diphosphate aldolase 
Protein accessionYP_590688 
Protein GI94968640 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3684] Tagatose-1,6-bisphosphate aldolase 
TIGRFAM ID[TIGR01232] tagatose 1,6-diphosphate aldolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.697087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTAA CCCCCGGAAA GCTTGCCGGA ATGAAGGCCG TATCGAATGA GCGCGGCGTG 
ATTGCTGCCG CCGCAATGGA CCAGCGCGGT TCGCTGAAAA AGGCGCTGGG CGCGAACGCG
ACCGATCGCA ACCTGGAAGA GTTCAAGGAA ATCGTGACCG AAGTTTTGAC GCAGCATGCG
TCGGCGATTT TGCTTGATCC TGAGTTTGGA TTGAGTGCGG CGAAGCATCG TGCGAAGAAC
TCGGGTTTGC TGCTCGCTTA CGAGAAGACT GGCTACGACA AGCAGACGCC AGGACGCTTG
CCTGATTTGC TTGATGTGTG GTCAGTGCGG CGGATCAAGG AAGCTGGCGG AGATTGCGTG
AAGATCCTGC TGTATTACGC ACCGGCTGAT CCGAAGCGCA TCAACGATCA TAAACACGCA
TGGACAGAGC GCATTGGCGA CGAGTGCCGG GCGAATGACA TTCCCTTCTT CCTCGAGATT
ATCGGCTATG AAGAAGGCAT GGACGAGAAG GGCGTTGATT ACGCCAAGAA GAAGCCGGAA
ATCGTGAAGG CTTACATGAA GGAGTTCTCG AACCCGCGCT ATGGCGTGGA CGTGCTGAAG
CTCGAAGTGC CGATCAATAT GCAATTCGTG GAAGGCACGA AGTCGTTCAA GGGGCAGAAG
GCGTACACGG TTGACGAAGC GAAGGAACAC TTCCGCGACT CGGCGAAGGC GACGAATTTG
CCGTTCATCT ATTTGTCGGC AGGCGTGAGC AATGCGGAGT TCATCGAGAC GCTGGAATTG
GTGTCAGGGA GCGGCGTGAA GTACAACGGC GTGCTCTGCG GACGCGCCAC CTGGAAGGAC
GGGATTCCGA TCTACGCGCA GCACGGCGGC AAAGCCTTCC ATGAATGGAT CAGCACGGAA
GGCGTGCAGA ACATCAATAA CGTGAACAAG GCGCTGGAGT CGGCGAGCTC GTGGTTCCCG
ATTTATGGAG TGGAGAAGGC GGGAGCGGGG CGGTAA
 
Protein sequence
MTLTPGKLAG MKAVSNERGV IAAAAMDQRG SLKKALGANA TDRNLEEFKE IVTEVLTQHA 
SAILLDPEFG LSAAKHRAKN SGLLLAYEKT GYDKQTPGRL PDLLDVWSVR RIKEAGGDCV
KILLYYAPAD PKRINDHKHA WTERIGDECR ANDIPFFLEI IGYEEGMDEK GVDYAKKKPE
IVKAYMKEFS NPRYGVDVLK LEVPINMQFV EGTKSFKGQK AYTVDEAKEH FRDSAKATNL
PFIYLSAGVS NAEFIETLEL VSGSGVKYNG VLCGRATWKD GIPIYAQHGG KAFHEWISTE
GVQNINNVNK ALESASSWFP IYGVEKAGAG R