Gene Namu_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4454 
Symbol 
ID8450081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4942002 
End bp4943162 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content67% 
IMG OID645043501 
Productglycosyl transferase group 1 
Protein accessionYP_003203729 
Protein GI258654573 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.455752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCCG TCGGGTCCGG CAGCGGCCGT CGCGCGCTGG TCGTGCATCC GGGCGCCGAC 
TTGTACGGAT CCGATCGGGT GTTGCTCGAG ACGGTGGGCG CGCTGGTGGA GGCGGGCTGG
GCGGTGACGG TCAGTGTTCC GGCGGCCGGC CCATTGGTTG CCGTCCTGGT CCATCGGGGG
GCCAGCGTCC AGGTGTGTCC GACGCCGGTC CTGCGCAAGA GCGTCCTGAG CCCCCGTGGT
GCGTTGCGCC TGGTCGGCGA CACGGCCAGA GCAATCCCGT CAGGAGCGGC GTTGATTCGT
CGCTGCCAGC CAGACGTGGT GATCGCCAAC ACGATCACCA TCCCGTTGTG GACATTGTTG
GGGCGGATGT TGCGCCGGCC GGTCCTGGTC CATGTGCATG AGGCCGAAGG TTCGGTTTCC
ATGGTTCGGC AACGGGTGAT GGCCGCGCCC CTGCTGCTGG CCACTGCGCT GGTGGCCAAC
AGTCGATGGA CTCGGGACGT GCTGACCAGG TCGTTCGCGC GCCTGGGACC GCGGACCTCG
GTCATTTACA ACGGAGTGGC CGGGCCCGCC TCTCCGGTGC CACCTCGACC GGCAATCCAA
GGATCGGCGA GGTTGCTGTT TGTCGGCCGA CTTTCCCCCC GAAAGGGTCC CGCCATCGCG
ATCCAGGCAT TGGCGCACCT GCGCCGTCGA GGAACGCCGG CCAGTCTGGA TGTGGTCGGT
GATTCGTTTG CCGGCTATGA ATGGTTCGCT GACGAACTGA GGCGGTTGGT GCAGCTGGAG
GGAGTGGCAG ACGCTGTGCG TTTTCATGGC TTCGTGTCGG ATATCTGGCG GCAAATGGCG
CAGGCCGACG TCGTGCTGGT TCCGTCGCAG GCCGACGAGT CTTTCGGCAA CAGTGCGATC
GAGGCCGTCC TGGGCGCCCG TCCGCTGGTG GTGACCCAGA TCCAGGGGCT ACTCGAGGCA
ACCGAGGGAT TCGCGGCCGT GAAGTCGGTG CCTCCCGGTG ATGCCGATGC CCTGGCCGGC
GGGATCGACG AGATCCTTTC CGAATGGTCG AGATTCGCCG AGCTGGCCGA ACGCGACGCC
CGGATCGCCG TCGAGCGTTT CGCACCGGCC CGCTACCGCC GCGACATGCT GGCCCGGGTT
GCCGGACTGG TCCGACCATG A
 
Protein sequence
MPPVGSGSGR RALVVHPGAD LYGSDRVLLE TVGALVEAGW AVTVSVPAAG PLVAVLVHRG 
ASVQVCPTPV LRKSVLSPRG ALRLVGDTAR AIPSGAALIR RCQPDVVIAN TITIPLWTLL
GRMLRRPVLV HVHEAEGSVS MVRQRVMAAP LLLATALVAN SRWTRDVLTR SFARLGPRTS
VIYNGVAGPA SPVPPRPAIQ GSARLLFVGR LSPRKGPAIA IQALAHLRRR GTPASLDVVG
DSFAGYEWFA DELRRLVQLE GVADAVRFHG FVSDIWRQMA QADVVLVPSQ ADESFGNSAI
EAVLGARPLV VTQIQGLLEA TEGFAAVKSV PPGDADALAG GIDEILSEWS RFAELAERDA
RIAVERFAPA RYRRDMLARV AGLVRP