Gene Namu_4445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4445 
Symbol 
ID8450072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4930069 
End bp4931334 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content70% 
IMG OID645043492 
ProductProtein of unknown function DUF1972 
Protein accessionYP_003203720 
Protein GI258654564 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.733886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATGA AGTCGAAGCA GGCCCGCCCG CTGCGGATCG CGCTGGTGGG AACCCGCGGG 
GTGCCGGCCC GGTACGGCGG GTTCGAGACC TGTGTCGAGG AGGTCGGCTC CCGGCTCGTC
GAGCGTGGCC ATGAGGTGGT CGTCTACTGC CGGCGCCGCG GCTCGGATCG CAGCGCGGAG
CTGGACAGCT ACAAGGGCAT GTCCCTGGTG CACTTCGGGG CCCTGAAGAA GCGCTCGCTG
GAGACCTTGA GCCACACGGC GCTTTCGGTG CAGCACCTGG TCCGCCATCG GACCGACGCG
GCGGTGGTCT TCAACGCGGC CAACGCGCCC TTCCTGCCGG CTCTGCGGGC GGCCCGGATC
CCCGTGGCCA CCCACGTGGA CGGGCTGGAA TGGAAGCGCG ACAAGTGGGG CGGCGCCGGC
CGTCGTTACT ACCTGATGGC CGAGCGACTG GCGGTCAAGT GGTCGGACGC GCTGATCGCC
GACGCGGTCG GCATCCAGGA CTACTACCTG GACAAGTTCG CCATGCCCAC CGATCTGATC
ACCTACGGCG CCCCGATCCT GGACACCGTC GGCGACCACC GGCTGGCCGA GCTCGGCCTG
ACCTCGGGCG GATACCACCT GGTGGTGGCC CGATTCGAGC CGGAGAACCA CGTGGACATG
ATCGTGGAGG GCTACTCGGC CAGTGCGGCC GAGCTCCCGC TGATCGTCGT CGGGTCCGCG
CCGTACGCGG ACGCCTACAC CCAGCGAGTG CACGAACTGG CCGATGGCCG GGTGCGGTTC
CTCGGTGGGG TGTGGGACCA GCAGCTGCTG GACCAGCTCT ACGCCAACGC CTTCACCTAC
CTGCACGGGC ATTCGGTGGG TGGGACCAAC CCCTCGCTGC TGCGGGCACT CGGGGCCTCG
GCCGCGACCA CTGCGTTCGA CGTCAACTTC AACCGTGAGG TGCTGGGCGG GGCCGGTCGG
TTCTTCTCCG ACGTGGCCGG GGTTCGCGCG CAGATCGAGG CCTCCGAACT GGACATCGCC
AGCACCGTCG AACTGGGTAC CCAGGCTCGG ATCCAGGCGA CCAAGTACGA CTGGGACGAT
GTGACCGACC GCTACGAGGA CCTGTGCCTT CGCCTGGCCG GACGCGATCG GGCGTTGGCC
GGTCCTCGGA CCGACGCGGT GGCCGCGGCC CCGCTCGATG CCTGGCTGGC GGAGATCGGC
ATCGCCGGGT CGGCGGAGCC GGTGCGCGTG CGGACCGGCT CGGAGCCCTC GCTCCGATCG
GCATGA
 
Protein sequence
MIMKSKQARP LRIALVGTRG VPARYGGFET CVEEVGSRLV ERGHEVVVYC RRRGSDRSAE 
LDSYKGMSLV HFGALKKRSL ETLSHTALSV QHLVRHRTDA AVVFNAANAP FLPALRAARI
PVATHVDGLE WKRDKWGGAG RRYYLMAERL AVKWSDALIA DAVGIQDYYL DKFAMPTDLI
TYGAPILDTV GDHRLAELGL TSGGYHLVVA RFEPENHVDM IVEGYSASAA ELPLIVVGSA
PYADAYTQRV HELADGRVRF LGGVWDQQLL DQLYANAFTY LHGHSVGGTN PSLLRALGAS
AATTAFDVNF NREVLGGAGR FFSDVAGVRA QIEASELDIA STVELGTQAR IQATKYDWDD
VTDRYEDLCL RLAGRDRALA GPRTDAVAAA PLDAWLAEIG IAGSAEPVRV RTGSEPSLRS
A