Gene Acid345_1025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1025 
Symbol 
ID4069849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1289770 
End bp1290912 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content57% 
IMG OID637983032 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_590102 
Protein GI94968054 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.906065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.822135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTTCC TGCACGTAGT GGGTGCGCGT CCCAACTTTA TGAAAGCAGC TCCGCTGATT 
CGCGCCCTCG AACAGCGCGG CTCCCGTCAA ACTCTCGTTC ACTCCGGTCA GCACTACGAC
CGCAACATGT CGACGGTCTT CTTCGATCAG CTAGGCATTC GCAAGCCCGA CGTCAATCTG
CAGGTCGGGA GCGGCAGCCA CGCGCAACAA ACCGCCGCCA TCATGAGCCG CGTCGAGCCC
GTGCTTCTCA ACCAACGTCC TGACGCTGTC ATCGTTTACG GCGACATCAA TTCCACGGTC
GCGGTCGCCC TCGTCTGCGC GAAACTTGGC ATCAAGCTCA TCCACGTCGA AGCCGGCTTA
CGCTCTTTCG ACCGCTCCAT GCCCGAGGAA ATCAATCGCC TCGTCACCGA TCAACTCGCC
GACGTCCTTT TCACGCCCTC ACTCGATGGC GATGAGAACC TGCATCGCGA AGGCATTCCC
GACAACAAAG TCCACTTCGT CGGTAACATC ATGATCGACA CCCTGGTGCG CCTTCTTCCC
CTCGCAGAAC TTCGCTTCGC CGACCTCGCC GCAAAATTCA ATCTCATAAA GTTCGGCCTC
GTCACGCTTC ATCGTCCATC TAACGTGGAC GACATCTCCC ATCTCGCACC GTTGCTCTTC
GCCCTCGATC GCATCGCAGA AGATCTCCCG CTGCTCTTCC CCGTTCATCC TCGAACCTTG
CAACACATGC AGGAGTTCAG CATCAACCTT CATCATCTCC AGATACTCGA GCCACTCCCG
TACATTGACT TTCTCTCCCT CCAGCAACGC GCCGCACTGG TCATCACCGA CTCCGGCGGC
ATCCAGGAAG AAACCACGTA TCTCGGCATT CCCTGCCTGA CAGTTCGCGA AAACACCGAG
CGACCCGTGA CCGTCACGCT CGGCACCAAC CTTCTTGTCG GTTCTGATTT CCATCGCATG
GAATCCGAAG CGCGCAAAGT CATCGCCGGT AACAAAAAGT GTGGGTCCAT TCCGCCACTT
TGGGACGGCC ACACCTCGGA CCGAATCGCC TCCATTCTCA TCAACTGTGG AAGTTTCCCC
GGCGACCCGA ACGATGTAAA AAACTCCCAC TCGTTACTTG TGGACGCGTC CGTAAGTGCT
TGA
 
Protein sequence
MHFLHVVGAR PNFMKAAPLI RALEQRGSRQ TLVHSGQHYD RNMSTVFFDQ LGIRKPDVNL 
QVGSGSHAQQ TAAIMSRVEP VLLNQRPDAV IVYGDINSTV AVALVCAKLG IKLIHVEAGL
RSFDRSMPEE INRLVTDQLA DVLFTPSLDG DENLHREGIP DNKVHFVGNI MIDTLVRLLP
LAELRFADLA AKFNLIKFGL VTLHRPSNVD DISHLAPLLF ALDRIAEDLP LLFPVHPRTL
QHMQEFSINL HHLQILEPLP YIDFLSLQQR AALVITDSGG IQEETTYLGI PCLTVRENTE
RPVTVTLGTN LLVGSDFHRM ESEARKVIAG NKKCGSIPPL WDGHTSDRIA SILINCGSFP
GDPNDVKNSH SLLVDASVSA