Gene Namu_4571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4571 
Symbol 
ID8450199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5090416 
End bp5092053 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content69% 
IMG OID645043612 
Productprotein of unknown function DUF404 
Protein accessionYP_003203839 
Protein GI258654683 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.903579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATC TGTTCGAGGA TTACCCGTTC GCTCGGGCCT GGGATGAAAT GTTCGCGGCA 
CCCGGAGAGA TCCGGCCCGC GTACGAATCG GTGTTCGCCG CGCTGCAGAC CATGGACGCG
GCCGACCTCA AGGCCCGAGC CGACATCATG GGCCGCACCT TCCTGGACCA GGGCATCACC
TTCGCCCTGG GCGGGGTGGA GCGGCCGTTC CCGCTGGACC TGATTCCGCG GATCGTGACC
GCAGCCGAGT GGCAGACGGT GGAAAAGGGC GTCCCGCAGC GGGTTCGGGC ACTGGAGGCG
TTCCTGGCCG ACGTCTACGG GCAGGGCCGG ATCTTCACCG ACGGCGTCGT GCCCAAGCGG
CTGGTCACCA CCTCCCCGCA CTTCCACCGG CAGGTCATGG GCATGAGCGC CCAGGACGGC
GCCCGGGTGG TGATCTCCGG GGTCGACCTG ATCCGGGACG AGAAGGGCGA GTTCCGGGTC
CTGGAGGACA ACGTCCGGGT CCCCTCCGGC GTGTCCTACG TGCTGGAGAA CCGGCAGGCG
GTCGCCCAGG TGCTCTCCGA GGCCGGCGCC GACCAGCTGG TGCGGCCGGT GTCGGAGTAC
CCCGGCCAGC TGCTGGCCGC GCTGCGCGCC GTCGCCCCGT GGAACGTCAC CGATCCCAAC
GTGGTCGTCC TCACCCCCGG CGTCTACAAC TCGGCCTACT TCGAGCACAC CCTGCTGGCC
CGGGAGATGG GCGTCGAGCT CGTCGAGGGA CGCGACCTGA TCTGCCGCAA CAACCGGGTC
TTCCTGCGTA CCACCTCCAG CGAGATGCCG GTACACGTCA TCTACCGCCG GATCGACGAC
GAGTTCCTGG ACCCGATGCA GTTCCGGGCC GACTCGCTGC TGGGCTCCCC CGGCCTGATC
AACGCGGCGC GGGCCGGCAA CCTGACCATC GCCAACGCGG TCGGCAACGG CATCGCCGAC
GACAAGCTGG TCTACACCTA CGTTCCGGAC ATCATTCGCT ACTACCTGAG CGAAGAGCCG
ATCCTGCAGA ACGTGGACAC CTACCGGATG GAGGTGCCCG ACCACCGCGA GTACGCCCTC
GAGCACCTGG CCGAACTGGT CCTCAAGCCG GTCGACGGAT CCGGCGGCAA GGGCATCGTC
ATCGGGTCCC GGGCGGATCG CGCGGTGCTG CGCAAGGCGC GGGAGACCAT CCTGGAGAAC
CCCCGCGGCT GGATCGCGCA GCGCGAGATC GCCCTGTCCA CGGTGCCCAC CCTGATCGGC
GAGAAGATGC GACCCCGGCA CGTGGACCTG CGGCCGTTCG CGGTCAACAA CGGGCGCAGC
GTCTGGGTGC TGCCCGGTGG CCTGACCCGG GTCGCGCTGC CCGAGGGCGA GCTGGTGGTG
AACTCCTCGC AGGGCGGCGG TTCCAAGGAC ACCTGGGTGC TCGGCGGACC GATCCCCGAG
CCCGAGCCGC AACCCGCGGC CGACGCCACC CAGGTGATGA ACATGCGCGA CCTGTCCTTC
CACCAGCCGA TCAGCCCGCC CGAGGACAAT CTCGGCTTCC GCACCCAGCA CGAACAGCAA
CAGCAACAGT CCGGCGGTGT CCGGGAACAG AACGTCCTGG CACACAGCGC CCGGGAACTG
GAGGAACCGC AGTGCTGA
 
Protein sequence
MADLFEDYPF ARAWDEMFAA PGEIRPAYES VFAALQTMDA ADLKARADIM GRTFLDQGIT 
FALGGVERPF PLDLIPRIVT AAEWQTVEKG VPQRVRALEA FLADVYGQGR IFTDGVVPKR
LVTTSPHFHR QVMGMSAQDG ARVVISGVDL IRDEKGEFRV LEDNVRVPSG VSYVLENRQA
VAQVLSEAGA DQLVRPVSEY PGQLLAALRA VAPWNVTDPN VVVLTPGVYN SAYFEHTLLA
REMGVELVEG RDLICRNNRV FLRTTSSEMP VHVIYRRIDD EFLDPMQFRA DSLLGSPGLI
NAARAGNLTI ANAVGNGIAD DKLVYTYVPD IIRYYLSEEP ILQNVDTYRM EVPDHREYAL
EHLAELVLKP VDGSGGKGIV IGSRADRAVL RKARETILEN PRGWIAQREI ALSTVPTLIG
EKMRPRHVDL RPFAVNNGRS VWVLPGGLTR VALPEGELVV NSSQGGGSKD TWVLGGPIPE
PEPQPAADAT QVMNMRDLSF HQPISPPEDN LGFRTQHEQQ QQQSGGVREQ NVLAHSAREL
EEPQC