Gene Apar_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0343 
Symbol 
ID8413191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp391957 
End bp393345 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content51% 
IMG OID645021910 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_003179365 
Protein GI257784148 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTTA CTGCGATTGT CCTCGCCGCA GGCGAGGGAA CGCGCATGAA GTCCCACCAC 
CCTAAGATTG TGCACAAATT ACTAGATAAA CCTATTGTTT GGTGGAGTGT CAACGCCGCA
ATTACTGCTG GCGCTGACCG CGTCATTGTA GTTGTTGGTA ACCACGCCGA CGAGGTTAAA
TCTGCACTTT CTTGCTTCCC TAACCTTGAG TATGTTGCCC AGACCGAGCG TCTTGGTACT
GGCCACGCAG TCAAGGTTGT AAAGGATGCT CTTGGTGGCT TCAAGGGACC CGTTGTTGTC
ATTAATGGTG ACGCATCTTT GTTGCGCGCA CAGTCTATTC TCGACCTTGT CGCAGAAACT
AAAGCTCATC ACAACGCGTG TACTGTACTT ACGATGACGC CGCCAGATCC AACTGGATAT
GGTCGCGTTA TTTCTTCTAA CGGCCAGGTA ACTGCCATTA TTGAACATAA AGACGCTACA
CCAGAGCAAC GCGAACAAGA ACGTGAATGC AACGTTGGTG TGTACTGCTT CTGTGGCGGA
AGACTGACCG CAAACATCGA TCTTCTCGGC AACGATAACG TCCAAGGCGA GTACTACATC
ACCGATATGG TGGGCCTCTA TGTAAGCCAA GGTGAGCCTG TTGCTGCCGT CCACGTTGAC
GACTACAAAG AAGCGCTGGG AGTCAACTCC CGCTCCGAGC TTGCTGTTGC TACCCGTATC
ATGCAGGAGC GCATCAACGA GCACTGGATG AGCCAAGGCG TCACCATGCT GGATCCAACC
TCGGTTTGGA TTGGCCCCGA GGTCACGCTT GGCATGGATA CCGAGGTTCT CCCTCAGACC
ATGCTCTATG GCAAAACCTC AATTGGCGAG AATTGCGTGA TAGGTCCTAA CACTCGTCTA
ACCGATACCT GCGTTGGCAA CGACGCTATT GTCGACGAAA CTGTTGCCAT TAACGCACAA
GTTGACGATT ACGCCACTTG TGGTCCTCGC GCTTATCTTC GTCCAGGTAC GCACCTTATG
CCCCACGCTA AAGCCGGCAC ACATGTTGAA ATCAAAAACT CAACTATCGG CGAAGGCTCC
AAGGTTCCTC ACCTCTCTTA CATTGGTGAT ACTACCATGG GCTCTGGCGT CAATATTGGT
GCAGGCTCAA TTACGTGCAA TTACGATGGT TACCACAAGT TTAAGACCCA CATTGGCAAC
AACGTCTTTG TTGGATCCGA TACCATGATG GTAGCACCTG TTTCAATTGG TGATGGCGCA
CTTGTGGGGG CAAGTTCGTG CATTACAAAG GATGTACCAG CAGATGCGCT TGCACTCGAG
CGCTCTGAGC AAAAAATTGT TGAGGGATAC GCCGCTCAGA GGCGTCACAA GCTTGAGAAA
GAGGACTAG
 
Protein sequence
MPLTAIVLAA GEGTRMKSHH PKIVHKLLDK PIVWWSVNAA ITAGADRVIV VVGNHADEVK 
SALSCFPNLE YVAQTERLGT GHAVKVVKDA LGGFKGPVVV INGDASLLRA QSILDLVAET
KAHHNACTVL TMTPPDPTGY GRVISSNGQV TAIIEHKDAT PEQREQEREC NVGVYCFCGG
RLTANIDLLG NDNVQGEYYI TDMVGLYVSQ GEPVAAVHVD DYKEALGVNS RSELAVATRI
MQERINEHWM SQGVTMLDPT SVWIGPEVTL GMDTEVLPQT MLYGKTSIGE NCVIGPNTRL
TDTCVGNDAI VDETVAINAQ VDDYATCGPR AYLRPGTHLM PHAKAGTHVE IKNSTIGEGS
KVPHLSYIGD TTMGSGVNIG AGSITCNYDG YHKFKTHIGN NVFVGSDTMM VAPVSIGDGA
LVGASSCITK DVPADALALE RSEQKIVEGY AAQRRHKLEK ED