Gene Namu_3365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3365 
Symbol 
ID8448980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3703243 
End bp3704385 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content74% 
IMG OID645042442 
Productglycosyl transferase group 1 
Protein accessionYP_003202682 
Protein GI258653526 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00178378 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0354182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGGCCCA ACGCTTCCGC GCGAACCAAC GCGCCGACGT CAAGTTTCAA GATCCTGGTC 
ATCGCCTCGC TCCGGTATCC GATCGCGCAG CCGTTCGCCG GCGGGCTCGA GGCGCACACC
TGGTCGCTGG CCACGGGGTT GCGGGCCCGC GGGCACTCCG TGCTGGTGGC CGGGGCCACC
GGCAGCGACC CGAGTGTGGT CGGCTACGAA TTCGGCCGCC TGCCCACCGG CGACGGGTCC
GAACGCGCCG ACATCACCAA TCATCCGGAC GTCAGCCGGG CCGAACGGCA GGGCTTCACC
GACCTGATCG CGCAGGTCCG GGACGGCCTG CTGGGCTCGT TCGACCTGAT CCACAACAAC
GCCCTGCACC CGTACCCGGT CGAGCAGGCG CACACCCTGG ACGTCCCGAT GGTCACCACC
CTGCACACGC CGGTGCTGCC CTGGGCGCAG CGGGTGCTGG GGGAATCGGC CGTGCCCCAG
CACAGCCAGC ACTTCGTGGC CGTCAGCCGG GCCACCGCCG ACGCCTGGCG CCCGCTGATC
CGGCCCCAGG TCGTCCGCAA CGGGGTGGAC ACCGACCTGT GGCGTCCGGG TCCCGGCGGG
CCCGGTGCGG TCTGGTCCGG GCGCATCGCC GCGGAGAAGG CCCCGCACCT GGCCATCGAC
CTGGCCCGGG CGGCCGGGAT CGAGCTGACC ATCGCCGGCC CGATCGTCGA CGAGCCCTAC
TACGCCGCCG CGGTCGCGCC CCGCTTGGGA CCGGGCGTCC GCTACGCCGG CCACCTGGAT
CAACAGCGCC TGGCCGAGCT GGTCGGGCAC AGTGCGCTCG CGCTGGTCAC CCCGGTCTGG
AACGAGCCGT TCGGGCTGGT CGCGGTCGAG GCGATGGCCT GCGGGACGCC GGTCGTCGCG
CTGGCCCGCG GCGGCCTGCC GGAGATCGTG GACCGCCGGT CCGGACGGCT GATCCCGCCC
ACCGAGGCCA CCGGGTTCGC CCCCGACGAC CTGGCCGCGG CGGTCCGGGC GATGGCGCAG
GCGGCCACCC TGGATCGCGG CGCGGTCCGG CAGCGGGCGC TGGCCCGGGG CAGCGCGGCG
GCGATGATCC GCGGCTACGA GCAGGTCTAT CAGCGGGCCG TTCGGCGCTG GACCCGGTCG
TGA
 
Protein sequence
MRPNASARTN APTSSFKILV IASLRYPIAQ PFAGGLEAHT WSLATGLRAR GHSVLVAGAT 
GSDPSVVGYE FGRLPTGDGS ERADITNHPD VSRAERQGFT DLIAQVRDGL LGSFDLIHNN
ALHPYPVEQA HTLDVPMVTT LHTPVLPWAQ RVLGESAVPQ HSQHFVAVSR ATADAWRPLI
RPQVVRNGVD TDLWRPGPGG PGAVWSGRIA AEKAPHLAID LARAAGIELT IAGPIVDEPY
YAAAVAPRLG PGVRYAGHLD QQRLAELVGH SALALVTPVW NEPFGLVAVE AMACGTPVVA
LARGGLPEIV DRRSGRLIPP TEATGFAPDD LAAAVRAMAQ AATLDRGAVR QRALARGSAA
AMIRGYEQVY QRAVRRWTRS