Gene Namu_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0503 
Symbol 
ID8446086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp557192 
End bp558442 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content70% 
IMG OID645039639 
Productprotein of unknown function UPF0118 
Protein accessionYP_003199911 
Protein GI258650755 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGAC GACGCTCACC GGGAGCAGGC CGGGCCGTCG CCGCCGATCC CGACTCCCGG 
ACCGGTGCCG ATGGCGGTAC CGATGCCGAC GACCTGGCCG GTCATCGCCC CACCGGACCG
GTCAGCACCG CCTCCGCGCG CCGGTTGGAA CGCGGGCTCG TGGAGTACTC GCAGTGGACG
CTGCGCCTGC TGATCATCGG CGTGGGGTTG TTCGCCGCGT TCTGGATCCT GGGCCAGCTC
TGGAGTGTGG TCCTGCCGAT CCTGCTCGGC CTGCTGCTGG CCACCATCCT GTGGCCGCCG
GTCCGGTTCA TGCGGCGCAA GCTGCCCAAT GCCCTGGCCG CGATCGTGGC CCTGATCGGC
CTGCTGGTGA TCTTCAGCGG GTTGATCGCG GTGCTGGCTC CCCAGGTGAC CTCGCAGGCC
GGCGAGCTGG TCGATCGGGC CACCGCCGGC CTGACCACCC TGCAGAGCTG GCTGGCCGGT
CCCCCGTTCA ACCTGGGCCC GGACGCGCTG GGCGGGCTGC TGGACAAGGG CATTTCCGAG
ATCCAGAGCA ACAGCCAGGA AGTGCTCGGG GTGGTGCTGG GCAGCCTGTC CGCCATCGGC
TCGGCCGTGA TCACCCTCGT CCTGGCCCTG GTGCTGTGCT TCTTCTTCCT CAAGGACGGC
CCCAAGTTCG TGCCGTGGCT GCGGACCTGG ATCGGTCGCG CCGCCGGCAC CCACTTCGCC
GAGCTGTCCG ACCGGGTGTG GACCGCCCTG GGCCAGTACG TCTGGTCGCA GGCGGCGGTC
GCCGCGGTCG ACGGCGTCTT CATCGGCGTC GGAGTGTGGT TGCTCGGCGT GCCGTTCGCC
CTGCCCATTG CGGTGCTCAC CTTCTTCGGC GGGTTCGTGC CGATCGTCGG TGCGTTCGTC
GCCGGGTCGG TCGCCGTCCT GGTCGCGCTG GTCTCCAACG GCATCTGGAC CGCCGTGGGC
GTGCTGGCGA TCGTGCTGGT CGTCCAGCAG CTCGAGGGCA ACGTGATGCA GCCGATCCTG
GTCGGCAAGA CGATGAACAT CCACGCCGCG GTGACCATCG CCGTGGTCGC CCTGGGCGGC
ACCCTGTTCG GCATCGTCGG GGCGTTCCTG GCCGTCCCGG CGGTGGCCGT GGTTCAGGTG
ATCGCCCGGT ACACCCGCGA GCAGCTGCAG GAGGCGCCCG ATTCCGCGCG CGATTCCGCG
CCCAATTCCG CGCCCAATTC CGGACCCACG CCCGATCCCG ACCCGGCCTG A
 
Protein sequence
MFGRRSPGAG RAVAADPDSR TGADGGTDAD DLAGHRPTGP VSTASARRLE RGLVEYSQWT 
LRLLIIGVGL FAAFWILGQL WSVVLPILLG LLLATILWPP VRFMRRKLPN ALAAIVALIG
LLVIFSGLIA VLAPQVTSQA GELVDRATAG LTTLQSWLAG PPFNLGPDAL GGLLDKGISE
IQSNSQEVLG VVLGSLSAIG SAVITLVLAL VLCFFFLKDG PKFVPWLRTW IGRAAGTHFA
ELSDRVWTAL GQYVWSQAAV AAVDGVFIGV GVWLLGVPFA LPIAVLTFFG GFVPIVGAFV
AGSVAVLVAL VSNGIWTAVG VLAIVLVVQQ LEGNVMQPIL VGKTMNIHAA VTIAVVALGG
TLFGIVGAFL AVPAVAVVQV IARYTREQLQ EAPDSARDSA PNSAPNSGPT PDPDPA