Gene Namu_4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4202 
Symbol 
ID8449828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4643191 
End bp4644312 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content72% 
IMG OID645043251 
Productglycosyl transferase group 1 
Protein accessionYP_003203480 
Protein GI258654324 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.730151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.251935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGGG TCGGCCTGGA CGGTACTCCG CTGCTCGGGC AGCGGACCGG CATCGGCCGG 
TACACCGAAC ACCTGCTCGC CGCTCTCGTT CGGCGCGGGG ACGTTACGGT GACGGCCACG
GCCTTCACCC TTCGGGGTGC CCGTGGACTG GCCGACGCCG TTCCGGCCGG CGTCCGGGCG
CGGTCGGTTC CTTCGCCCGC GCGGGCGTTG CGCGCTCTGT GGACGCGGTG CGAGGTGCCG
ACGGTCCGGT CGTTCAGCGG CCCGGTCCAG GTGTTCCATG CGACCAATTT CGTGCTGCCG
CCGACCGGGC GGGCTGGGGG CGTGGTCACC ATCCACGACC TCGCCTATCT CACCCGCCCG
GGCACCGTCG ATGGGACTAG CCGGCAGCTG CTTGAGCTGA TGCCGCGAAG CCTGGCGCGG
GCCGCGGTGG TGTGCACCCC GACTCATGCG GTCGCCGCCG CGGTCCGGGA CGCCTACGGG
CCGGTGGTGC AGGACCTGGT AGTCACGCCA TTAGGCGTCG AGGCGGACTG GCTGTCCCTG
AATCCCCCCG GGCTCGATGA GCGGGCCAGG CTCGGGCTGC CCGGCGAGTA CCTGCTGTTC
GTCGGCACTC GGGAACCCCG CAAGGACCTT CGTACCCTGC TGGCCGCCTA TGACCGCTAC
CGGGCGGCCG CACCGGACCC CGCCGACATT CCCGACCTGG TGTTGGTGGG GGCCCGGGGA
TGGGGACCGG ACGAGCGTCC GGGGCCGGGT GTGCTCATTC GGGACTACAC ACCGGCCGAC
GAGCTCAAGA CCATCGTGGC CGGAGCGCGA GCTTTGATCA TGCCGTCACG GGACGAAGGA
TTCGGTCTCC CGGCCCTGGA GGGGTTGGCC GCCGGCGTGG CCGTGATCGT CAGCGACATC
CCGGCCCTGA TAGAGGTCTC GGGCGGGCAT GCAGACGCCT TCCCGATGGG GGACCCGGAC
GCCTTGGCCG ACCTGCTCGG CACGGTCACC GCCCGGGAAC GAGGGCGCAG CGACGCAGAA
CGGGCGAGTG ACCGGCTCCG GCGCCGTCGG TACGCGGCCC GGTGGACCTG GGACCGGTGT
GCCGAGCAGA CCATGAGCGC CTATAGGCGC GCCGCCGGCT GA
 
Protein sequence
MIRVGLDGTP LLGQRTGIGR YTEHLLAALV RRGDVTVTAT AFTLRGARGL ADAVPAGVRA 
RSVPSPARAL RALWTRCEVP TVRSFSGPVQ VFHATNFVLP PTGRAGGVVT IHDLAYLTRP
GTVDGTSRQL LELMPRSLAR AAVVCTPTHA VAAAVRDAYG PVVQDLVVTP LGVEADWLSL
NPPGLDERAR LGLPGEYLLF VGTREPRKDL RTLLAAYDRY RAAAPDPADI PDLVLVGARG
WGPDERPGPG VLIRDYTPAD ELKTIVAGAR ALIMPSRDEG FGLPALEGLA AGVAVIVSDI
PALIEVSGGH ADAFPMGDPD ALADLLGTVT ARERGRSDAE RASDRLRRRR YAARWTWDRC
AEQTMSAYRR AAG