Gene Namu_4816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4816 
Symbol 
ID8450446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5362379 
End bp5363740 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content78% 
IMG OID645043855 
Productprotein of unknown function DUF58 
Protein accessionYP_003204080 
Protein GI258654924 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG CGAGCGGCGG CCGGCCGACC CCGGTCCCGG CCCTGGGCCC GGCGGCGGCG 
GAGGTGCCTC CAGCGGTGCC TCACGCGGTG CCCCCGCCGG TGCCCCCGCC GCCCGGGCCG
GCCCGGGCCC GGCTGGACGG GTCGGCGCTG GCCGCGGCGT GGGCCACCGT CACCCGGCGC
GGCCGGGTGG CGCTGGCGGT GCTGGTCGGC GCCGCCCTGG TCGGCTGGCT GACCGGGTGG
CGCGAATGGA CCGGGCTGGC CGCGGGGCTG GCCGTGGTGA TGCTGGTCGC CGTGGCGATG
GCGCTGGGCC GCTCACCGGT CGCGATCGAT CTGGATCTGG CCCGCACCCG GTTCGTGGTC
GGTGACCCGG CGGTCGCCCG GGTCGGGGTG CGCAACGTCT CCGGCCGGCG GATGCTGCCG
CTGCGGTTGG AGCTGGACGT CGACGGCCTG CCCGCGCAGG TGCGGGTGCC GTCCCTGCCG
GGCGGGGCCG CCCACCCGGT GGTCATCCCG CTGCCCACGC ACCGCCGGGG GGTCATCGAG
CTGGGGCCCG CCCGTGCCGT GCGCGGGGAC GTGTTCGGCC TGCTGCGCCG GGTGGTCCAG
TGGCCGGTGC ACGAGCAGGT GTACGTGCAT CCGCGGACGG TGCAGCTGCC CGACCCGCTG
CCCGGCCGGG CCCGGGACCT GGAGGGGGAG GAGTCGGCCA TCCGCACGGC CAGCGACCTG
TCGTTCCACA CGCTGCGCGA CTACGTGCCG GGCGATGACC GCCGCTTCAT CCACTGGAAG
TCGACCGCCC GCAGCGGCAC GCTGCAGGTC CGCGAGTTCC TGCAGACGCA CCGGTCGCTG
GTCGCGGTGG TATTGGCCGG CAACCCGGAC GACTACCGCG CGGCCGGCTG GTCGCCGGGG
GCCGCCGGCG ACGGGTCCGA CGCGGGCACC TCGCCGGAGT TCGAGGTCGC GGTCAGCTGC
GCGGCCTCGA TCGTGGCCGA GCTGGTCCGG CGCCACCGCG ACGTCGTGGT CGACGCGGCC
GGTTCGGCGA TCCGCGCCGC CTCGGACCAG GGTGTGTTGG ACCGGTTCAG CCCGGTGCGC
ACCGTCGCCG GTTCGCCCGA CCTGCTGGCC ATGACCCGGC AGGTGGCCCG CCGGCACCCG
CGCACCTCCC TGGTGGTGCT GGTGTTCGGG TCCACCGTCG AGCCGGCCCG GCTGCGCGCG
GCCGCCCGGC TGGGCCCGAC CGGCGCGACC GTGCTGGCGG TGCGGGCCCG GGTGCCGGAC
TCACCGCACC CGGCCACGCT GGCGCCGCTG AGCACGGGGG CCGTCGTCAC CGTCGAGGAC
GTGGCCCAGC TACCGCTGGC CCTGCGCGGG GTCCGGCGAT GA
 
Protein sequence
MSEASGGRPT PVPALGPAAA EVPPAVPHAV PPPVPPPPGP ARARLDGSAL AAAWATVTRR 
GRVALAVLVG AALVGWLTGW REWTGLAAGL AVVMLVAVAM ALGRSPVAID LDLARTRFVV
GDPAVARVGV RNVSGRRMLP LRLELDVDGL PAQVRVPSLP GGAAHPVVIP LPTHRRGVIE
LGPARAVRGD VFGLLRRVVQ WPVHEQVYVH PRTVQLPDPL PGRARDLEGE ESAIRTASDL
SFHTLRDYVP GDDRRFIHWK STARSGTLQV REFLQTHRSL VAVVLAGNPD DYRAAGWSPG
AAGDGSDAGT SPEFEVAVSC AASIVAELVR RHRDVVVDAA GSAIRAASDQ GVLDRFSPVR
TVAGSPDLLA MTRQVARRHP RTSLVVLVFG STVEPARLRA AARLGPTGAT VLAVRARVPD
SPHPATLAPL STGAVVTVED VAQLPLALRG VRR