Gene Ndas_1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1552 
Symbol 
ID9245402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1902056 
End bp1903105 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003679487 
Protein GI297560513 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.233578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.142881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC CCTCGCATGC CCCCGATTCG GCGGAGCAGG CGGAGCAGAA CGACGCCGTG 
AACGTCGCTG CGCAGGCCCA CGGGGTCCGC GGAGACTCCG ACAACCGGTC CCTGCGCCAG
ATCGCCTGGC AGCGGCTGCG CAAGGACAAG GTCGCGATGG TCTCCGGCGT GGTCGTGGTC
CTGCTCATCC TCGCCGCGAT CCTCGCGCCC CTCCTGGCCA AGTGGTTCGG GCATCCGCCC
ACCCAGTTCC ACCAGGACCT GATCGAGCCC GGCACCGGCC TGCCGGCCAA CGACCCGGCC
AACCCGAGCC CGTTCGACAC CGACCCCTGG GGCGGTATCA GCGCCGACCA CCTGCTCGGC
GTGGAGCCGG TGACCGGACG CGACCTGTTC AGCCGCATCC TCTACGGCGC CCAGATCTCC
CTGCTGGTGG CCTTCCTGTC CACGCTGCTG TGCGTGTTCA TCGGCACTGT CCTGGGCATC
GTCGCCGGGT ACAAGGGCGG CTGGGTCGAC ACCCTCATCA GCCGGGCCAT GGACATCTTC
CTGGCCTTCC CGCTGATGCT CTTCGCCATC GCGCTCGTGG GCGTCATCCC CGACGGCGTC
CTGGGCCTGA GCGGCAACGG CCTGCGCATC GGCGTCATCG TCTTCATCAT CGGCTTCTTC
AACTGGCCCT ACATCGCGCG CATCGTCCGG GGGCAGACGC TCTCGCTGCG CGAGCGGGAG
TTCGTGGAGG CCGCCAGGAG CCTGGGCGCC AGCAACCGGC ACATCCTCTT CCGGGAGATC
CTGCCCAACC TGGTCACGCC GATCATCGTC TACTCGACCC TGCTCGTCCC CACGAACATC
CTGTTCGAGG CGGCCCTGAG CTTCCTGGGC GTCGGTATCA ACCCGCCCAC GCCGAGCTGG
GGCAAGATGC TCTCCGACGC GGTGCCGCTG TACGAGAAGG CGCCCTACTT CGTGGTCTTC
CCGGGTCTGG CCATCTTCAT CACCGTCCTG GCGTTCAACC TGTTCGGCGA CGGGCTGCGC
GACGCCTTCG ACCCCAAGAC CTCCGACTGA
 
Protein sequence
MSAPSHAPDS AEQAEQNDAV NVAAQAHGVR GDSDNRSLRQ IAWQRLRKDK VAMVSGVVVV 
LLILAAILAP LLAKWFGHPP TQFHQDLIEP GTGLPANDPA NPSPFDTDPW GGISADHLLG
VEPVTGRDLF SRILYGAQIS LLVAFLSTLL CVFIGTVLGI VAGYKGGWVD TLISRAMDIF
LAFPLMLFAI ALVGVIPDGV LGLSGNGLRI GVIVFIIGFF NWPYIARIVR GQTLSLRERE
FVEAARSLGA SNRHILFREI LPNLVTPIIV YSTLLVPTNI LFEAALSFLG VGINPPTPSW
GKMLSDAVPL YEKAPYFVVF PGLAIFITVL AFNLFGDGLR DAFDPKTSD