Gene Namu_2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2949 
Symbol 
ID8448562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3229794 
End bp3231296 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content69% 
IMG OID645042034 
ProductNa+/solute symporter 
Protein accessionYP_003202276 
Protein GI258653120 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0128221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000189844 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCTCG TCATCATCGC CGTCTATATC GGCGCCATGA TCGGCATCGG CTTCTACGCC 
AAGGGCAAGG CCACCAGCGA GTCGGACTTC CTGGTCGCCG GACGCCGGCT CGGCCCGATG
CTCTACGCCG GCACGATGGC CGCCGTGGTC ATCGGCGGCG CCTCCACCAT CGGTGGCGTC
GGCCTGGGCT ACGAGTACGG CCTGTCCGGA CTCTGGCTGG TCTTCTCCAT CGGCTGCGAC
ATCCTGTTCC TGAGCCTGGT CTTCGCCGGC CGGATCAACC GGGCCGGCGT CTACACGGTC
AGCCAGCTGC TCGAACTGCG ATACGGGCGC GGCGCCAGCC TGCTGTCCGG GATCGTCATG
TGGGCCTACA CCCTCATGCT GGCGGTTACC TCCACCATCG CCTACGGCAC CGTCTTCGGC
GTGCTGTTCG ACATCGGCAA GGTCCCGTCG ATCATCATCG GCGGCGCCGT CGTGGTCACC
TACTCGGTGC TCGGCGGCAT GTGGTCGATC ACGCTCACCG ACTTCGTGCA GTTCATCATC
AAGACCATCG GCATCTTCGT CCTGCTGCTG CCGGCCGCCC TGATCAAGGC CGGCGGCTTT
GCCGGGCTGG CCGAGAAGCT GCCGGACACG GCGTTCTCGT TGACCACCAT CGGCGGCGGC
ACGATCGTGA CCTACTTCGT CATCTACTTC TTCGGCCTGG TCATCGGCCA GGACATCTGG
CAGCGGGTCT TCACCGCCCG CAGCGACAGC GTCGCCAAAT GGGCCGGCAC CGCCTCCGGC
GTCTACTGCC TGCTCTACGC GGTGGCCGGG GCGCTGATCG GGATGAGCGC CAAGGCGATC
CTGCCGAACC TGGCCGAGCG GGACGACGCC TATCCCGCGG TCGTCCAGGC CGTGCTGCCG
GTCGGCGTGG CCGGCCTGGT CATCGCCGCC GCCCTGGCCG CCATCATGTC CACCTCCAGC
GGCGCCCTGA TCGCCACCGC GACGGTGGCC AAGGAGGACA TCGTGGCCTC CTTCCGGCGG
CGCCGGCACC CGGCCGCCAC CGGCGCCTCG GCCGGCACCG CCGACGAGTC CGCGACGCCC
GCCCAGGAGC ACGACGAGGT CTCCGGCAGC CGTTGGTACA TCTTTGGTTT CGGCCTGCTG
ACGATCGGCA TGGCCTGCGT CATCAGCGAC GTCATCGCCG CGCTGACCAT CGCCTACGAC
ATCCTCGTCG GCGGCCTGCT GGTGGCCATC CTGGGCGGCC TGATCTGGAA GCGCGGCACC
ATCACCGGCG CGCTGGCCTC CATCGGCGCC GGCGTGGTGG TCACCCTGGG CACCATGGCG
TACTACGGCG ACATCTACGC CAACGAGCCG ATCTTCGCCG GCCTGATCGT CAGCCTGCTC
GCCTATGTGG TGGTCAGCCT GATCACCCCG GCCACCCCGG AACCGATCCG GGCCGAATGG
GACCGCCGGG TCAACCGCAA GCGCACCGCC GCCACCTCCC CCGCCGCGAC CCAGCAGGCC
TGA
 
Protein sequence
MDLVIIAVYI GAMIGIGFYA KGKATSESDF LVAGRRLGPM LYAGTMAAVV IGGASTIGGV 
GLGYEYGLSG LWLVFSIGCD ILFLSLVFAG RINRAGVYTV SQLLELRYGR GASLLSGIVM
WAYTLMLAVT STIAYGTVFG VLFDIGKVPS IIIGGAVVVT YSVLGGMWSI TLTDFVQFII
KTIGIFVLLL PAALIKAGGF AGLAEKLPDT AFSLTTIGGG TIVTYFVIYF FGLVIGQDIW
QRVFTARSDS VAKWAGTASG VYCLLYAVAG ALIGMSAKAI LPNLAERDDA YPAVVQAVLP
VGVAGLVIAA ALAAIMSTSS GALIATATVA KEDIVASFRR RRHPAATGAS AGTADESATP
AQEHDEVSGS RWYIFGFGLL TIGMACVISD VIAALTIAYD ILVGGLLVAI LGGLIWKRGT
ITGALASIGA GVVVTLGTMA YYGDIYANEP IFAGLIVSLL AYVVVSLITP ATPEPIRAEW
DRRVNRKRTA ATSPAATQQA