Gene Namu_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1406 
Symbol 
ID8447002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1556163 
End bp1557293 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID645040537 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_003200796 
Protein GI258651640 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.515015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAGCT TCGAATCGGT ATCCAAGATC TACCCGGACG GCACCCATGC CGTGGAGGAA 
CTCTCCCTGG AGATCGCGAC CGGCCGCATC ACGGTATTCG TGGGCCCGTC CGGCTGCGGC
AAGACGACCT CGCTGCGCAT GATCAACCGG ATGATCGAGC CCACCCACGG CGTGCTGTCC
ATCGACGGGC GGGACATCTC CACCGTCGAC GCCCCGGTGT TGCGGCGCGG CATCGGGTAC
GTCATCCAGA ACGCCGGCCT GTTCCCGCAC CGCACAGTGC TGGACAACGT GGCCACCGTG
CCGGTCCTGC AGGGCCGCAG CCGCCGGGAG GCCCGGCTGG CGGCGGCCGA ACTGCTGGAC
CGGGTGGGCC TGGACCGCAA CCTGGCCAAG CGCTATCCGG CCCAGCTCTC CGGCGGGCAG
CAGCAGCGGG TCGGGGTGGC CCGGGCCCTG GCCGCCGATC CGCCGGTGAT GCTGATGGAC
GAGCCGTTCT CCGCGGTCGA CCCGGTGGTC CGCAACCAGC TGCAGGACGA GCTGATCCGG
TTGCAGGCCG ATCTCGGTAA GACCATCGTG TTCGTCACCC ATGACATCGA CGAGGCGGTC
AAGCTCGGCG ACCGGATCGC GGTCTTCGCG GTCGGCGGCC GGCTGGCCCA GTACGCCGAG
CCGGCCGAGG TGCTCAGCCG GCCGGCGGAC GATTTCGTCG CCGACTTCGT CGGCCGGGAC
CGCGGTTACC GCGCCCTGTC GTTCGTCACC GGCGACCTGC CGGTCCGGCC GGAGCAGACG
CTGACCTTGG GCGCCCCCGT GCCTCGCGGG GCCGGCAGCG CCGCCGAGGG CATCTCCGCC
GATCACGGGC GCTGGATCCT GGTGGTGGAC GACGACCGCC GGCCGCGCGG CTGGCTGGAC
TGCGCGGCGG TCCCGGTCGG GCATCCGGTC GGGGTCGACG ATCTCGTGCT GGGCGGCTCG
CTGGCCGCGC CGGACGGATC CCTGCGCCGG GCCCTGGACG CGGCGCTGTC CTCCCCGTCC
GGGCGCGGCG TGGCGGTCGA TGCCGACGGC GCCGTAGTCG GCACCATCAC CGCCGCCGAG
GTGCTGACCG CGATCGAGCA GTCCCGTGGT CCGGACAAGG TCACGACATG A
 
Protein sequence
MISFESVSKI YPDGTHAVEE LSLEIATGRI TVFVGPSGCG KTTSLRMINR MIEPTHGVLS 
IDGRDISTVD APVLRRGIGY VIQNAGLFPH RTVLDNVATV PVLQGRSRRE ARLAAAELLD
RVGLDRNLAK RYPAQLSGGQ QQRVGVARAL AADPPVMLMD EPFSAVDPVV RNQLQDELIR
LQADLGKTIV FVTHDIDEAV KLGDRIAVFA VGGRLAQYAE PAEVLSRPAD DFVADFVGRD
RGYRALSFVT GDLPVRPEQT LTLGAPVPRG AGSAAEGISA DHGRWILVVD DDRRPRGWLD
CAAVPVGHPV GVDDLVLGGS LAAPDGSLRR ALDAALSSPS GRGVAVDADG AVVGTITAAE
VLTAIEQSRG PDKVTT