Gene Ndas_5144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5144 
Symbol 
ID9249037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp287399 
End bp288712 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content63% 
IMG OID 
Productpreprotein translocase, SecY subunit 
Protein accessionYP_003683030 
Protein GI297564057 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.552257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTAGGTG CGTTCGTTCG TGCATTCCGC ACGCCCGACC TGCGCAACAA GCTGCTGTTC 
ACACTGTTCA TCCTGACGAT CTTCCGTCTG GGCTCGGTGA TCCCGGCGCC GGGTATCAAC
TCCGCGGCGA TCAGGGACCA GATGGAGGCG ATCACCTCCT CGGACCAGTC CGGTGTGTAC
GCGCTCGTCA ACCTGTTCAG CGGCGGTGCC CTGCTGCAAC TGGCGGTGTT CGCGCTCGGG
GTCATGCCGT ACATCACCGC GAGCATCATC ATCAACCTGC TCACGGTGGT GATCCCCCGT
CTGGAGGCCC TCAAGAAGGA GGGCCAGTCC GGACAGAGCA AGATCACCCA GTACACCCGC
TACCTGACGC TCATGCTCGG TGTCCTCCAG GCCACCAGCA TCGTCGCCAT GGCCCGGACG
GGCGCCCTGT TCCAGGGGGC CATCCCGGTC AGCTCCTACA TGCCCAACCA GGACATCCTC
ACCCTGGTGA CCATCGTCTT CACGATGACG GCCGGTACCG CCATCATCAT GTGGTTCGGC
GAGCTCATCA CCGAGCGCGG CGTGGGCAAC GGCATGTCGC TGCTGATCTT CACCCAGGTC
ATCGCGATGT TCCCCTCCTC CATGGTCGCG CTGTTCCAGG AGCGCTCGAT CTGGGTCTTC
TCCCTCATCT GCATCGCCGG CCTGGTGCTC ATCACCGCGG TGGTCTTCAT GGAGCAGGCC
CAGCGCCGCA TCCCGGTGCA GTACGCCAAG CGCATGGTCG GCCGCCGCAT GTACGGCGGC
AGCTCGACCT ACATCCCGCT GAAGGTGAAC CAGGCGGGCA TCATCCCCGT GATCTTCGCC
TCCTCGCTGC TGTACCTGCC GCAGCTGCTC GTGGGACTCA TGGGACAGGA CTCCACCCAT
CCCGTGGTGA CCTTCGTGCA GGACTACTTC CTCACCGGCA CCCACCCCGT GTACATGGCG
ACGTTCTTCG TCATGATCAT CGGCTTCGCG TTCTTCTACG TGGCGATTAC CTTCAACCCC
GCTGAGGTCG CCGACAACAT GAAGAAGTAC GGTGGGTTCA TCCCGGGTAT CCGACCAGGG
CGTCCGACCG CCGAGTACCT CGACTACGTG CTGACGAGGC TGACGACCCC CGGATCCCTG
TACCTGGGTG TGATCGCTCT CCTGCCGATG GTCGCCCTCG GCGCCACCGG CGCCAGTGCG
AACTTCCCCT TCGGAGGGAC GAGCATCCTG ATCATGGTCG GCGTCGGGCT GGACACGGTG
AAGCAGATCG AGAGTCACCT CCAGCAGAGG AACTACGAAG GTTTTCTGCG ATAA
 
Protein sequence
MLGAFVRAFR TPDLRNKLLF TLFILTIFRL GSVIPAPGIN SAAIRDQMEA ITSSDQSGVY 
ALVNLFSGGA LLQLAVFALG VMPYITASII INLLTVVIPR LEALKKEGQS GQSKITQYTR
YLTLMLGVLQ ATSIVAMART GALFQGAIPV SSYMPNQDIL TLVTIVFTMT AGTAIIMWFG
ELITERGVGN GMSLLIFTQV IAMFPSSMVA LFQERSIWVF SLICIAGLVL ITAVVFMEQA
QRRIPVQYAK RMVGRRMYGG SSTYIPLKVN QAGIIPVIFA SSLLYLPQLL VGLMGQDSTH
PVVTFVQDYF LTGTHPVYMA TFFVMIIGFA FFYVAITFNP AEVADNMKKY GGFIPGIRPG
RPTAEYLDYV LTRLTTPGSL YLGVIALLPM VALGATGASA NFPFGGTSIL IMVGVGLDTV
KQIESHLQQR NYEGFLR