Gene Namu_4557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4557 
Symbol 
ID8450185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5071771 
End bp5072970 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content72% 
IMG OID645043598 
ProductTransglycosylase domain protein 
Protein accessionYP_003203825 
Protein GI258654669 
COG category[S] Function unknown 
COG ID[COG3583] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTTAC AAATAGACAA CGTTGACGCC CCGTCCGGGC CGAACGAGCA GCCGGCCCGC 
CGGGGCCTGC GCCGCAAGCT GGTGCTGGTC GGCGTGGCCG CCGCGCTCGG TGTCGTGGCT
GTCGGTGGCG CCACCGCCGC CGCCATGTCC AAGCACGTCG TCATCACCGT CGACGGCCAG
GACCAGCAGG TCACCACGCT GTCCGGCTCG GTGGCCGGTG CGCTGTCCGC CGCGGGCCTG
TCCGCCGGTG AGCACGACGT GCTCGCGCCG GCCGCCGACA CCGCCATCTC CGACGGTTCG
CACATCGCCC TGGAGCGCGC GCGGCTGTTG ACCCTGACGG TCAACGGGAC CACGCAGCAG
CTGTGGACGA CCGCCGACAC CGTCGAGGAG GCCCTGCTGC AGCTCGGCCA GGATCCGTCG
GCCTACCAGC TGTCCGCGGA CCGGTCCCGG GAGATCCCGC TGGACGGTCT GGACCTGACC
GCCTCCACCC TGCACACCGT CAGCCTGGCC GTCGGCGGGG CTCCGGCCAC CACCGTCCAG
TCCGGCGGAC AGACCGTCGC CGACGTGCTG GCCGCCCAGG GCATCACCCT GGCCGCGACC
GACACCGTCG ACCCGGCCGG CACCACCCCG GTCACCGACG GCACCGCGAT CACCGTGACC
CGGGTCGCCG TCACCACCAC CACCGACACC GTCGCGGTCG CGCCGGCCGA CCAGACCGTC
GAGGATCCCA ACCTGGACAA GGGCACCACC CAGGTCGTCG CCGCGGGTAC CCCCGGCCAG
CAGCAGGTCG TCACCCAGGT CACCACCACC AACGGGGTGG AGACCGGCCG TCAGGAGCTG
TCCCGCACCA CGGTGCTCGA GGCCACCCCC AACCAGGTGC ATGTCGGCAC CAAGTCCACC
CTGGACTGGC AGGGCAGCCG GGTGTTCTTC CACGACACCG AGTTCGGCGT GAACTGGGAC
GGTCTGGCCT ACTGCGAGTC GACCAACAAC CCGCACGCGG TCAACAACCC GGCCGGCTAC
CTGTCGACCT ACGGCCTGTT CCAGTTCGAC CTGCCCACCT GGGCCTCGGT CGGCGGCTCG
GGCAACCCCG GGGATGCCTC CCCGGAGGAG CAGTTGACGC GGGCCAAGTT GCTCTACCAG
TCCCGTGGGC TGGAGCCGTG GCTCTGCGGC TACGCCGCCA GCGGCCCGCC CGCCGGCTGA
 
Protein sequence
MTLQIDNVDA PSGPNEQPAR RGLRRKLVLV GVAAALGVVA VGGATAAAMS KHVVITVDGQ 
DQQVTTLSGS VAGALSAAGL SAGEHDVLAP AADTAISDGS HIALERARLL TLTVNGTTQQ
LWTTADTVEE ALLQLGQDPS AYQLSADRSR EIPLDGLDLT ASTLHTVSLA VGGAPATTVQ
SGGQTVADVL AAQGITLAAT DTVDPAGTTP VTDGTAITVT RVAVTTTTDT VAVAPADQTV
EDPNLDKGTT QVVAAGTPGQ QQVVTQVTTT NGVETGRQEL SRTTVLEATP NQVHVGTKST
LDWQGSRVFF HDTEFGVNWD GLAYCESTNN PHAVNNPAGY LSTYGLFQFD LPTWASVGGS
GNPGDASPEE QLTRAKLLYQ SRGLEPWLCG YAASGPPAG