Gene Ndas_5272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5272 
Symbol 
ID9249170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp436467 
End bp437777 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID 
Productseryl-tRNA synthetase 
Protein accessionYP_003683158 
Protein GI297564185 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0719173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGAG GATGGAGCCC GGACGGTAAC CTGCAAGACG TGATCGACCT TCGCGCTCTT 
CGAGAAGACC CCGAACGACT CCGCGCCTCG CAGCGGGCAC GGGGGGAGGA CCCCTCCGTC
GCCGACCGCC TGCTCGGGCT CGACGCCGAC CGCCGTTCCG CGCTGTCCCG GTTCGAGACC
CTGCGCGCCG AGCAGAAGAG CGTGGGCAGG TCCGTCTCCA AGGCCTCCCC GGAGGAGCGC
GAGGAACTCC TGACCCGCGC CAAGGCGCTG GCCGCCGAGG TCAAGGAGGC CGAGGCCGAG
GCCGGCCGCC TCGCCGACGA GCTGGACGCC CTCCTGTCCG GCGTGCCCAA CCTCGTGGAG
GAGGGCGCGC CCGAGGGCGG CGTGGACGAC TTCAGGATTC TGGAGACCGT CGGCACCCCG
CGCTCGTTCG ACTTCACCCC GCGCGACCAC CTGGAGCTGG GCGAGATGCT CGGCGCCATC
GACATGGAGC GGGGCGCCAA GGTCTCCGGG GCCCGGTTCT ACTTCCTCAC CGGCGTGGGC
GCCCAGCTGG AGCTGGCGCT GCTCAACATG GCCATGAACC AGGCCGTCCA GGCGGGCTTC
ACGCCGATGA TCCCGCCGGT GCTGGTGCGG CCCGAGACCA TGGAGGGCAC CGGTTTCCTC
GGCGCGCACG CCGACGAGGT CTACCACCTG CCCGCCGACG ACCTCTACCT CGTGGGCACC
TCCGAGGTGC CGCTGGCCGG GTACCACTCG GGGGAGATCC TGCCCGCCGA CGCCCTGCCC
AACCGCTACA TCGGCTGGTC GCCGTGCTTC CGCCGCGAGG CGGGCTCCTA CGGCAAGGAC
ACCCGCGGCA TCATCCGCGT CCACCAGTTC AACAAGGTGG AGATGTTCGT CTACACCCAC
CCCGACCAGG CGCACGAGGA GCACAAGAGG CTGCTCGCCT GGGAGCGGGA GATGCTGGAC
AAGCTGGAGC TGCCCTACCG CGTGGTGGAC ATCGCCGGCG GCGACCTGGG CACCAGCGCC
GCCCGCAAGT ACGACTGCGA GGCGTGGGTC CCCACCCAGG AGACCTACCG GGAGCTGACC
TCCACCTCCA ACTGCACCGA GTTCCAGGCC CGCCGCCTCA ACGTGCGGTT CCGGGACGAG
GACGGCAAGC CCCGCTTCGC CGCCACCCTG AACGGCACGC TGGCCACCAC CCGCTGGATC
GTCGCCATCC TGGAGAACCA CCAGCGCGAG GACGGCTCCG TGGTGGTCCC CGAGGCGCTG
CGCCCCTACC TGGGCCGCGA CGTCCTGGAG CCGATCGCCA AGGGCAGGTA G
 
Protein sequence
MSGGWSPDGN LQDVIDLRAL REDPERLRAS QRARGEDPSV ADRLLGLDAD RRSALSRFET 
LRAEQKSVGR SVSKASPEER EELLTRAKAL AAEVKEAEAE AGRLADELDA LLSGVPNLVE
EGAPEGGVDD FRILETVGTP RSFDFTPRDH LELGEMLGAI DMERGAKVSG ARFYFLTGVG
AQLELALLNM AMNQAVQAGF TPMIPPVLVR PETMEGTGFL GAHADEVYHL PADDLYLVGT
SEVPLAGYHS GEILPADALP NRYIGWSPCF RREAGSYGKD TRGIIRVHQF NKVEMFVYTH
PDQAHEEHKR LLAWEREMLD KLELPYRVVD IAGGDLGTSA ARKYDCEAWV PTQETYRELT
STSNCTEFQA RRLNVRFRDE DGKPRFAATL NGTLATTRWI VAILENHQRE DGSVVVPEAL
RPYLGRDVLE PIAKGR