Gene Acid345_4610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4610 
Symbol 
ID4070767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5463282 
End bp5464457 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content61% 
IMG OID637986650 
Productputative mannose-1-phosphate guanyltransferase 
Protein accessionYP_593684 
Protein GI94971636 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.157776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.623431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTAA GGGCCATACT CGTCGTCGGC AACACCGCGC CTGCCAATAC CAAAGCGCCG 
CGCACTGAGT CCTTCGCGGG ATTACCGCTC GCCCTCTACG ATGTGCTCGG CAAGTCGGTC
CTCGCACGCA CTGTCCAGCG CCTGCAGCAT TTCGGCATCA CGCAGTTCAC TACCGTGTGC
GATGAGAAGA CTGGCGGCGC GCTTGACGCT CGCGTGAAAC AGGAATGGAA CGTTGTCGTC
GAAGATCACG GCCTCTGGCG CTCTGCCGAG TCGGTCTTCA ATGACTTCGT GCAGCAGGGC
GCCGAACTGG TGCTCGTTCT GCGCATCGGC GCCTACGCCG AACTCGATTA CGAAGAATTC
GTCCAGTTCC ATCTCGAACA GCACGGGCGC GCTACCGTCG CCGTGGACGC TCGCGGTTAC
CGTATCGGCA TTGTCGCGCT CTCTGCTTCA CGTCGCAACG ACGCCGCCTA CCTTTTCCGT
CACAACATGG AAGAGTTCCG GGTGCCCTGC GTGCAATATC AGTTCCGCGG CTATCTCAAC
CGCCTCATCA CGCCTGCTGA CCTTCGCCAG CTCAGCGTTG ACGCGCTTCT CCAGAACAAC
CACATCCAGC CCATCGGCCG CGAGATTCGG CCTGGCGTCT GGGCAGGCCC ACGCGCGCGC
ATTCATCCCC GCGCCCGCAT CGTCGCCCCG GCGTACATCG GAGAGCGAGC CAAGCTCCGC
GCCTCCGCTG TCATCACCCG CTTCACCTGC ATCGAACATC ATGCCGTCGT GGATTGCGGC
ACGGTGCTTG AGAACACTTC GATCGCGCCC TATACCTACG TCGGCGCCGG GCTCGATATC
AATTACGCCA TCGTCGGCCG CAACCGTATC GCCAGTTTGC GTCGCAATGT AGAGGTCGAA
ATTAAAGACC CGAGGTTGCT GTCGACACTC GCAACCAGTG CTCCAGTCCG CGCCCTGCGC
GAGATGGGGG CCCTCGCTGC ATATTTGCCC CAACAGTTGT TCCGCGGCAT GTTCTCGCGG
TCCCAGCGCA CCATGCCGAC GGACATCCCG GCCGCAGTCA ATACTCCTTC CGCAGCGTTG
AAGGAGCCGG CCCTAACGAA CCCAGGTGGG AGCATCAACT CAACACCAGA GTTCCCCTCC
GAATTAGTGG TAGTGAGGAG ATATGGAAAC GAGTAG
 
Protein sequence
MDVRAILVVG NTAPANTKAP RTESFAGLPL ALYDVLGKSV LARTVQRLQH FGITQFTTVC 
DEKTGGALDA RVKQEWNVVV EDHGLWRSAE SVFNDFVQQG AELVLVLRIG AYAELDYEEF
VQFHLEQHGR ATVAVDARGY RIGIVALSAS RRNDAAYLFR HNMEEFRVPC VQYQFRGYLN
RLITPADLRQ LSVDALLQNN HIQPIGREIR PGVWAGPRAR IHPRARIVAP AYIGERAKLR
ASAVITRFTC IEHHAVVDCG TVLENTSIAP YTYVGAGLDI NYAIVGRNRI ASLRRNVEVE
IKDPRLLSTL ATSAPVRALR EMGALAAYLP QQLFRGMFSR SQRTMPTDIP AAVNTPSAAL
KEPALTNPGG SINSTPEFPS ELVVVRRYGN E