Gene Namu_3532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3532 
Symbol 
ID8449151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3881690 
End bp3882976 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content73% 
IMG OID645042610 
Productglycosyl transferase group 1 
Protein accessionYP_003202846 
Protein GI258653690 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00173558 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0189831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTTG ATTCCTCGCC CCTGGCCCCG ATCGCGGTGC CGATCACGCC GCCGGCGACG 
CTGGACCTGT CCCTGGACTC CCCGGCCGCC GAGCCCCGGC TCAAGATCGT GCTGGTCGAG
TTCCTGCCCA GCGGCGGCAT GTTCCAGTTC ACCTTCCTGT TCGGCGAGGC GCTGGCCCGG
CAGGGCCACG AGGTGCTGCT GCTCACCGGT CCTGACCCCG AACTGCGGTC GAACACCCCG
GGCTTCGAGG TCGTCGAGCT CTTCCCGACC TGGCACCCCA ACGTCGATCC CGGGGGGTCG
GCGCTGCGAC GCAAGGCCCG CCGGCTGGGC CGGGCCGCCC TGCTGGTCGA GTCGTGGCGG
CGGGCCATCG CGTTCTTTCG CCGGGTGCAT CCCGACCTCG CCCAATTCGG CGAGCTGCGT
TATCCGCTGG ACAGCGCGAT GCTGCTGCTG CTGGCCCGGC GCAGTCCGGA GACCGGGCTG
GTCGACGTGG CCCACAACCC GCTGCCCTAC GACGTGAACG GCCGGGCGAC CGCGGTGGAG
AAGACCGGGC GGCTGACCCG TTCGCTGTTG GCCGCGGCCT ACCGGGCCTG CGACCTGATC
CTGGTGCTCG GCGAGGGCCC GCGGACCAGC CTGCTGACGG CGTTCCCGCG GCTGGGCCGG
GTGGCCGTCT GCGGGCACGG GGACTACTCC GCGGTGCTGG CCACCGAGCA GGCACCGCCG
CCGTCGTCGG CGCCGGCGAA TGCCCTGTTC TTCGGGGCCT GGACCAAGTA CAAGAACCTG
CCGCTGCTGC TGGACTCCTT CGAACTGGTC CGCCTGCAGT TGCCGCAGGC CCGGTTGACC
ATCGCCGGTC CGGTCATGCC GGACGTCGAC CTGGAATCGA TCACCCGGCG GGCCGAGCAG
ATCGGGAACG TGGACCTACG CCCCGGGTAC GTCCCGATGG ACGAGGTCGC CGCCCTTTTT
GCGGCTCACC GGACCGTCGT GTTCACCTAC ACGACGGTCA ACATCAGCGG CAGCGTGCAC
ATGGCCTACA CCTTCGGCCG GCCGGTGGTG GCCACCGACG TCGGCTCGAT GCGCGACGCG
GTCGCCGACC ACGTGACCGG CCGGCTGGCC GCGGCTGACC CGGCCGCGGT GGCCGCCGCG
ATGGTCGAGG TCCTGGGTGA CCCGGCCGCG GCCGACCGGA TGGGCGCGCA GGCCCAGCAG
CACGCCCGCA GCAGCGCCTC CTGGGCCTCG GTGGTCGACA AGGCGGTGCC GGCCTACCGC
GCCGCGGTGG CCGCGGTCCG CCGCTGA
 
Protein sequence
MAVDSSPLAP IAVPITPPAT LDLSLDSPAA EPRLKIVLVE FLPSGGMFQF TFLFGEALAR 
QGHEVLLLTG PDPELRSNTP GFEVVELFPT WHPNVDPGGS ALRRKARRLG RAALLVESWR
RAIAFFRRVH PDLAQFGELR YPLDSAMLLL LARRSPETGL VDVAHNPLPY DVNGRATAVE
KTGRLTRSLL AAAYRACDLI LVLGEGPRTS LLTAFPRLGR VAVCGHGDYS AVLATEQAPP
PSSAPANALF FGAWTKYKNL PLLLDSFELV RLQLPQARLT IAGPVMPDVD LESITRRAEQ
IGNVDLRPGY VPMDEVAALF AAHRTVVFTY TTVNISGSVH MAYTFGRPVV ATDVGSMRDA
VADHVTGRLA AADPAAVAAA MVEVLGDPAA ADRMGAQAQQ HARSSASWAS VVDKAVPAYR
AAVAAVRR