Gene Namu_4553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4553 
Symbol 
ID8450181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5066306 
End bp5067421 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content73% 
IMG OID645043594 
Producthypothetical protein 
Protein accessionYP_003203821 
Protein GI258654665 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCG GGCCCGTGCT ACCCGGGGTG CTGGCCGCCG TGCTCGTCGC CGGTCTGACG 
GCCTGCGGCT CGTCGCCGTC CGCGGCGTCC TCGTCCGCCC CGGCGGTCTC GGCGGCGGCG
ACGGCGGTCT CGGCGGTCTC GGCCTCCGCG GGGGCCACGG CGGACCACCG CGCGGTGGTC
ATCGTCTCCG GCGGCGACGC GGTGAGCCCG TTCACCACCC CGACCCAGGC CTGCACCACC
GGCCTGGCTG CCGGCAACAC CGACACCGCC CTGCGGGCGG CGTTGCTGGC CGACGCTCAG
CAGGTCTACA CGGCGCCGGC CATGAATGCT CGCAGTGCCG TGATCGAGCC GGACCCGACC
TCCTTCGGCG CGTTCGGCGA CTGCCCGGCC CCGCTGCCGG CCTTCATGAC GATCCTGTCC
AACGGCGACA TCGACAACGG GGGCGAGCAT CTGGCCCACT TCGTCAACTA CCTGCACGAC
ACCGAGGGCG TGACCGAGAT CGACTGGGTG GGTCACTCGA ACGGCGGGCT GTGGGCCCGG
GCCGCGACCC GCATCCTGCG GGACACGGGC AGCCCGGTGC GGGTCGGCTC GCTGACCACC
GTCGGCACTC CGTGGGAGGG GGCCGTCCCG TTCCGGATCG TGTTCGGCGA GTTGCCCGAA
TCCACCTGCC TGGACAACGC CGTGTGCCTG AACCTGGTGG CCGTCATGCG GGAGGAGGCC
AAGGGCGACC TGGGGCTGGG CCGGCAGCAG CTGGCGAGCT ACCTGCTCGG CGACGGCGGC
TGGAACGCGG CCCAGCTGGG CGTGCTCGAC ACCATTCCCG TCCACCTGAT CGGCGGTGGC
TACCTCACCG AACCGGCCGG CGATCCGCAG ATCTGGCCGT TCGACGGGCT GGTCTCGCAG
TACTCGGCCA CCGCCCAGGG GCTGCCGGAG CAGACCGCCC CGCTGCGCAC CTGTAGCCGC
TACCCGCTGA CCCACAGCAT CTACATCTCC CTCGAGCTCG GCCTGGACTG GCAGACCGCG
CTGACCTGGA ACGACGACGT GATGGCCGAC GTGACCGGCT TCGTGCGGTC CGTGCAGCAG
GGCGGCCCGC CCACCGGCGA GCCCTGCACC TCCTGA
 
Protein sequence
MSTGPVLPGV LAAVLVAGLT ACGSSPSAAS SSAPAVSAAA TAVSAVSASA GATADHRAVV 
IVSGGDAVSP FTTPTQACTT GLAAGNTDTA LRAALLADAQ QVYTAPAMNA RSAVIEPDPT
SFGAFGDCPA PLPAFMTILS NGDIDNGGEH LAHFVNYLHD TEGVTEIDWV GHSNGGLWAR
AATRILRDTG SPVRVGSLTT VGTPWEGAVP FRIVFGELPE STCLDNAVCL NLVAVMREEA
KGDLGLGRQQ LASYLLGDGG WNAAQLGVLD TIPVHLIGGG YLTEPAGDPQ IWPFDGLVSQ
YSATAQGLPE QTAPLRTCSR YPLTHSIYIS LELGLDWQTA LTWNDDVMAD VTGFVRSVQQ
GGPPTGEPCT S