Gene Ndas_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0397 
Symbol 
ID9244235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp488252 
End bp489361 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content74% 
IMG OID 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_003678351 
Protein GI297559377 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.742277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGT TCGAGGGCGC CGCCAAGCTC TATCCCGACG GCACGGTGGC CGTCGACCAG 
CTGGACCTGA CCGTCGAGAC CGGCCAGACG ACGGTCTTCG TGGGTCCCTC CGGCAGCGGC
AAGACCACGT CGCTGCGGAT GATCAACCGC ATGGTCGAAC CGACCGGGGG CACCGTCCGC
ATCGACGGCG AGGACGTGCG CGAGCGCGAC CCCGCCGCGC TGCGCCGCTC CATCGGCTAC
GTCATCCAGC AGGCCGGGCT CTTCCCGCAC CGCACCGTGC GCGACAACAT CGCCACCGTG
CCCCTGCTGC TCGGCTGGGG CCGGGCCAGG GCCAGGGCGC GCGCCGCGGA GCTGATGGAA
CTGGTAGGTC TGGAGCCCGC CCAGGCCAGG CGCTACCCCC ACCAGCTCTC CGGGGGGCAG
CAGCAGCGCG TCGGCGTCGC CCGCGCGCTG GCCGCCGACC CGCCCATCCT GCTGATGGAC
GAACCCTTCA GCGCCGTGGA CCCCGTCGTG CGCGCCAGCC TCCAGGACGA GCTCCTGCGC
CTGCAGAAGG AGCTGCACAA GACCATCGTC TTCGTCACCC ACGACATCGA CGAGGCCGTC
CGGCTCGGTG ATCGCATCGC CGTCTTCCGC CCCGGCGGGA GGCTCGCCCA GTACGACACG
CCCCAGAACC TGCTGGCCGC GCCCCAGGAC GCCTTCGTGG AGTCCTTCAT CGGCTACGAC
CGGGGAGTGC GGCGCCTGTC CTTCTTCCCG GCCGACAGGC TCTCCCCGCG CCAGGACGCC
GTCCTGGAGG AGAGCGTGCG CGCGGGCGCC GCGGTCGCCC CGCTCGGGAA CGAGCCCTGG
GCCCTGGTCG TCAGCGGCGA CCGCATGCCC CTGGGCTGGG TCAGCGCGCG GCAACTGGCC
GACGCGCCCG CCGACACCGC CCTGGGCTCA CTCGAACTCG CGCCCTTCGG CCACACCTTC
GACGTGGGCA CCGACTCCCT GCGCGCCGCC CTGGACGCGG CGGTGCTCTC GCCCGCGGGC
CGCGCGGTCG GCGTCGACGC CGACGGTCGG GTGGTCGGCG TGGTCTCGCA GGACGACCTG
GGCGCCGCTC TGTGGTCGGT GACCGAGTGA
 
Protein sequence
MIAFEGAAKL YPDGTVAVDQ LDLTVETGQT TVFVGPSGSG KTTSLRMINR MVEPTGGTVR 
IDGEDVRERD PAALRRSIGY VIQQAGLFPH RTVRDNIATV PLLLGWGRAR ARARAAELME
LVGLEPAQAR RYPHQLSGGQ QQRVGVARAL AADPPILLMD EPFSAVDPVV RASLQDELLR
LQKELHKTIV FVTHDIDEAV RLGDRIAVFR PGGRLAQYDT PQNLLAAPQD AFVESFIGYD
RGVRRLSFFP ADRLSPRQDA VLEESVRAGA AVAPLGNEPW ALVVSGDRMP LGWVSARQLA
DAPADTALGS LELAPFGHTF DVGTDSLRAA LDAAVLSPAG RAVGVDADGR VVGVVSQDDL
GAALWSVTE