Gene Ndas_3978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3978 
Symbol 
ID9247849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4758448 
End bp4759716 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content67% 
IMG OID 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_003681881 
Protein GI297562907 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.819909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACG ACATCCTGCG GTTCGCGACG GCGGGTTCCG TCGACGACGG CAAGAGCACC 
CTGATCGGCC GACTGCTGTT CGACTCCAAG TCCATCTTCG AGGACCAGCT CGACGCGGTG
GAGCGCACCA GCGTCGCGCG CGGCGAGGAG CAGACCAACC TGGCGCTGCT CACCGACGGC
CTGCGCGCCG AGCGCGAACA GGGCATCACC ATCGACGTGG CGTACCGCTA TTTCGCCACC
CCCAAGCGCA CCTTCATCAT CGCCGACACC CCCGGGCACA TCCAGTACAC CCGGAACATG
GTCACGGGCG CCTCCACGGC CGACCTCGCC ATCATCCTGG TGGACGCGCG CAAGGGCCTG
CAGGAGCAGA GCCGCCGCCA CGCCTTCCTC ACCACCCTGC TCCAGGTGCC CCACCTGGTG
CTGGCGGTCA ACAAGATGGA CCTGGTGGAC TACTCCCAGG AGCGCTTCGA GGAGATCCGG
GCCGAGTTCG CCGACTTCGC CACCAAGCTG GACGTGTGCG ACCTGACCTT CGTGCCGATC
TCGGCCCTGA ACGGCGACAA CGTGGTGAGC CGCTCGGAGA ACATGCCCTG GTACACCGGG
CCCTCCCTGC TCCACCACCT GGAGAACGTG CACATCGCCT CCGACCGCAA CCTCATCGAC
GCGCGCTTCC CCGTGCAGTA CGTGATCCGG CCGCACAGGT CGGCCGACCC CGAGCTGCAC
GACTACCGGG GTTACGCGGG CCAGATCGCG GGCGGCGTCC TCAAGCCGGG CGACGAGGTC
ACCCACCTGC CGTCCGGGCT GAACACGCGG ATCGCCAGGA TCGTCACCGC CGACGGCGAC
GTCACCGAGG CCTACTCGCC GATGTCGGTC ACCCTCCTGC TGGAGGACGA GATCGACATC
TCCCGGGGCG ACATGATCTG CCGCCCCAAC AACGCCCCCG CGGTCACCCA GGACCTGGAG
GCGATGGTGT GCTGGATGAC GGACGCGCGC AAGCTCACGC CGCGCTCCAA ACTCATCCTC
AAGCACACCA CCCGCACCGC GAAGGTCATG GTCAAGGACC TGCGCTACCG GCTGGACGTC
AACACGCTGC ACCGCGACGA GCAGGCCGAC CACCTGTCGC TCAACGAGAT CGGCCGGGTG
CGGCTGCGCT CCACCCAGCC CCTGTTCGTG GACGAGTACG CCAAGAACCG CCAGACGGGC
GGGTTCATCC TCATCGACGA GTCCACCAAC ACCACGGTCG CCGCGGGGAT GGTCGTCAAG
ACCGACTGA
 
Protein sequence
MSNDILRFAT AGSVDDGKST LIGRLLFDSK SIFEDQLDAV ERTSVARGEE QTNLALLTDG 
LRAEREQGIT IDVAYRYFAT PKRTFIIADT PGHIQYTRNM VTGASTADLA IILVDARKGL
QEQSRRHAFL TTLLQVPHLV LAVNKMDLVD YSQERFEEIR AEFADFATKL DVCDLTFVPI
SALNGDNVVS RSENMPWYTG PSLLHHLENV HIASDRNLID ARFPVQYVIR PHRSADPELH
DYRGYAGQIA GGVLKPGDEV THLPSGLNTR IARIVTADGD VTEAYSPMSV TLLLEDEIDI
SRGDMICRPN NAPAVTQDLE AMVCWMTDAR KLTPRSKLIL KHTTRTAKVM VKDLRYRLDV
NTLHRDEQAD HLSLNEIGRV RLRSTQPLFV DEYAKNRQTG GFILIDESTN TTVAAGMVVK
TD