Gene Acid345_3792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3792 
Symbol 
ID4071076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4478351 
End bp4479640 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content56% 
IMG OID637985815 
ProductGDP-mannose 6-dehydrogenase 
Protein accessionYP_592866 
Protein GI94970818 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCT CTATTGTGGG TCTGGGATAT GTAGGTGCAG TTTCAGCGGC GTGCCTGGCG 
AGCCGAGGGC ACAATGTCTG GGGCGTGGAT ATCAACGCCG ACAAGGTTGC TGCACTGAAC
GAAGGGAAGT CGCCGATCAT CGAGCCAAAG CTCGCTGAAT TGCTCTCTGC GGCGGTTTCG
AGCGGCAAAC TGCGCGCCAC GTGCGACATG TCCGAAGCTC TCGCACAAAC AGACATTTGC
TTCGTCTCGG TTGCGACGCC GAGCCGGAAG AACGGCCAGA TCGACGCCGG CCACTTGCTG
CGGGCCTGTC AACAGATAGC GGATGTTATC AAGTTGCTAG ATCGCAAACA GGTCATCGTC
ATCCGGAGCA GTGTGCTCCC CAGTGTTTTC GAAGAGGCGA CGAAGCTGTT CGCGACGCAT
GTGCCAGGGT TGGCAGAACT TTGCATCAAT CCGGAGTTTT TGCGTGAAGG CTCGGCGATT
GCCGACTACG AGAACCCACC TTTTACGGTG ATCGGTACTG ACAAGCCGGA GGTAGAGAAA
ATGCTCCGCG ACTTGTACTC CGATCTCAGC GCTCCGGTGT ACGTTCTTGG CGCTAAGGAA
GCGACGTTGG TGAAGTACGC CAGTAATGCC TTCCACGCTC TCAAAGTCGC ATTCGCGAAT
GAAATCGGAG CGGTCTGCCA CGAGAACGGT ATCGACGGTG ACGCTGTAAT GGAAGTGTTT
TGCAAAGACA ATAAGCTGAA CATCTCTCAT CGGTATTTGC GCCCTGGCTT TGCTTTTGGT
GGTTCCTGTC TTCCGAAAGA TGTGCGTGCC CTCCTCTATG CCGGCAAGGT TGCGGACATT
GAGTTGCCAT TGCTGCGCGG CGTGTTGGAT AGCAACGAGT GCGTGATTGA GCGTGCGTTC
GACGCGATTC TCGAGACAGG TGCTCGTCGC ATCGGTCTCG TGGGCTTAAG CTTTAAGAAG
AACACCGACG ATCTCCGCGA AAGCCCATTC GTTGAACTGG CGGAGCGCCT CGTTGGCAAA
GGACTCGCCC TGCGAATCTA TGACCCGAAT GTCTCCGTGG CACGCCTGGT GGGCGCGAAT
CGTGAGTACA TCCAACAGCG ATTGGTGCAT CTTTCCTCTC TGCTCACAGA GTCGATCGAA
GAACTCGCTG TAGCGTGCGA CCTCATCATT ATTGGGCACA AGTTCGAAGG CGTCGAAAAA
CTTCAGGCCC ATCGCGATCG ATATCACATA CTCGACCTGA CCGGTCCATT CGCACTTCAT
CCACGGCAAT CGGCTGCCAA GGCAGGATAG
 
Protein sequence
MNISIVGLGY VGAVSAACLA SRGHNVWGVD INADKVAALN EGKSPIIEPK LAELLSAAVS 
SGKLRATCDM SEALAQTDIC FVSVATPSRK NGQIDAGHLL RACQQIADVI KLLDRKQVIV
IRSSVLPSVF EEATKLFATH VPGLAELCIN PEFLREGSAI ADYENPPFTV IGTDKPEVEK
MLRDLYSDLS APVYVLGAKE ATLVKYASNA FHALKVAFAN EIGAVCHENG IDGDAVMEVF
CKDNKLNISH RYLRPGFAFG GSCLPKDVRA LLYAGKVADI ELPLLRGVLD SNECVIERAF
DAILETGARR IGLVGLSFKK NTDDLRESPF VELAERLVGK GLALRIYDPN VSVARLVGAN
REYIQQRLVH LSSLLTESIE ELAVACDLII IGHKFEGVEK LQAHRDRYHI LDLTGPFALH
PRQSAAKAG