Gene Ndas_0957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0957 
Symbol 
ID9244802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1172670 
End bp1173917 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content69% 
IMG OID 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003678907 
Protein GI297559933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.408649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGCA AGAAGGTCCT AGGCCTCACT GCCGCAACCG CGGCCCTCGT CCTCGGCCTG 
ACCGCGTGTG GCAGCGACGG GGGAGAGGGA GGCGGCGGCG GTGAGGGCGG TGAAGAGTTC
ACCTACGGCA TCCTCTACCC GCAGACCGGC AACCTCGCCT TCCTCGGCCC GCCCCAGATC
ACGGCCGCCG AGTACGCGAT CTCCGAGATC AACGCCGCGG GCGGCATCCT CGGCACCGAG
GTCCCCGCCA TCGTCGAGGG CGACGAGGCG GGCGACAACG CCCAGGCCAA CGAGGCCGCC
AACAACCTCG TCTCCGACCA GGTGAACGCC GTCATCGGCG CCGCGGCCTC CGGCATGACC
CAGGCGACCT ACGACACCAT CACCGGTGCC GAGATCGTCC AGTGCTCGGG CTCCAACACC
GCCGCCGAGC TGAGCGAGAT CGAGGACAAC GGCTACTACT TCCGCACCGC GCCGAGCGAC
CTGCTCTCCG CGGTCGTGAT GGCCCGCACG ATGGTCGAGA ACGGCAACCA GAACATCGCC
ATCGTCGCGC GCGCCGACGA CTACGGCGGC GGCTACGCGG GCGCCCTCCA GACGGAGCTG
GAGAACCTGG GCGCCCAGGT CGTGGTCAAC GAGACCTACG ACCCGCTGGC CACCACCTTC
GACTCGGTCG TCAACAGCGT CACCACCGAG GAGCCGGACG CCGTCGCGCT CATCGCCTTC
GAGGAGGGCG CGCAGGTCAT CGCCCAGCTC CTGGAGGGCG GCACCGAGGG CGAGCAGCTC
TACGTCACCG ACGGCCTCAA CGACCCGAAC CTGGGCGAGA CCGTCAGCGC CGACAGCCCC
GAGAGCGTCA CCGGCATCAC CGGTATCGCC CCGAGCGCGG ACAACCCCGA GTTCACCGAG
GGCCTGACCA GCTTCAACGA GGAGCTGGAG GTCTTCCAGT TCGCCCCGCA GGTCTACGAC
TGCGTCACCG TGATCGCCCT GGCCGCCGAG GCCGCGGGTA GCGTGAACCC GTCCGAGTAC
GTCGCCGAGC TGCCCAACGT CAGCCGTCCC GAGGGCACCG AGTGCGGCAC CTTCGAGGAG
TGCCGCGACC TGCTGGCCGA CGGTGAGGAG ATCAACTACC AGGGCGTCAG CGGCAACATC
GACTTCAACG ACAACGGCGA CCCGACCGCC GCCACCTTCG AGATCTTCCA CTACGGGGAG
GACGGCCACG AGATCCTGGC CTACGAGGAG CACTCCCTGG AGGAGTAG
 
Protein sequence
MASKKVLGLT AATAALVLGL TACGSDGGEG GGGGEGGEEF TYGILYPQTG NLAFLGPPQI 
TAAEYAISEI NAAGGILGTE VPAIVEGDEA GDNAQANEAA NNLVSDQVNA VIGAAASGMT
QATYDTITGA EIVQCSGSNT AAELSEIEDN GYYFRTAPSD LLSAVVMART MVENGNQNIA
IVARADDYGG GYAGALQTEL ENLGAQVVVN ETYDPLATTF DSVVNSVTTE EPDAVALIAF
EEGAQVIAQL LEGGTEGEQL YVTDGLNDPN LGETVSADSP ESVTGITGIA PSADNPEFTE
GLTSFNEELE VFQFAPQVYD CVTVIALAAE AAGSVNPSEY VAELPNVSRP EGTECGTFEE
CRDLLADGEE INYQGVSGNI DFNDNGDPTA ATFEIFHYGE DGHEILAYEE HSLEE