Gene Ndas_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3174 
Symbol 
ID9247031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3799957 
End bp3801174 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content72% 
IMG OID 
Producttryptophan synthase, beta subunit 
Protein accessionYP_003681088 
Protein GI297562114 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.546685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACG ACCAACCCCT GCCCGACCAG CTCGGGCACT ACGGCCGCTT CGGCGGCCGC 
TTCGCGCCCG AAGCCCTCGT CGCCGCCCTC GACGAGGTGG CCGCGGAGTG GGAGAAGGCC
AAGCAGGACC CCCAGTACCA GGCCGAACTC GCCGAACTGC TCAAGGACTA CACCGGGCGC
CCCAGCGCCC TGAGCGAGGC CCGCAACTTC TCCGAGCACT GCGGCGGCGC GCGGATCCTG
CTCAAGCGCG AGGACCTCAA CCACACCGGA TCCCACAAGA TCAACAACGT CCTCGGCCAG
GCCCTGCTCA CCAGGCGCAT GGGCAAGACC CGCGTCATCG CCGAGACCGG AGCCGGACAG
CACGGCGTGG CCACCGCCAC AGCCTGCGCC CTGCTCGGCC TGGAGTGCGT GATCTACATG
GGCGAGGAGG ACACCCGCCG CCAGGCGCTC AACGTCGCCC GCATGCGCAT GCTCGGCGCC
GAGGTCGTCC CCGTCACCAT CGGCAGCCGC ACCCTCAAGG ACGCCATCAA CGAGGCCTTC
CGCGACTGGG TGGCCAACGT CGACCGCACC CACTACCTGT TCGGCACCGT CGCCGGACCG
CACCCCTTCC CCAAGCTCGT GCGCGACCTG CACTTCGTCG TCGGCCAGGA GGCCCGCGAA
CAGGTCCTGG AGCGCGTCGG CAGGCTCCCC GACGCGGTCG CCGCGTGCGT GGGCGGCGGC
TCCAACGCCA TGGCCGTCTT CGCGGCCTTC ATCCCCGACG AGGAGGTCGC CCTGTACGGC
TTCGAGGCCG GTGGGGAGGG GGCGCGGACC ACCCGCACCG CCGCCTCCAT CACGGCGGGC
AGCCCCGGCG TCTTCCACGG GGCGCGCACC TTCGTGCTCC AGGACGAGTA CGGCCAGACC
CTGCCCAGCC ACTCCATCTC CGCCGGACTC GACTACCCGG CGGTGGGCCC CGAGCACGCC
TACCTCGCCG ACACCGGCCG CGCCACCTAC GAGCCGGTCA CCGACGCCGA GGCGATGGAG
GCCTTCCGGC TGCTGTGCCG CACCGAGGGC ATCATCCCCG CCATCGAGAG CGCGCACGCC
CTGGCCGGCG CCCGCAAGCT CGGCGAGCGC CTCGGCCCGG ACGCCGTCAT CCTGGTGAAC
CTCTCCGGGC GCGGCGACAA GGACGTTGAC ACCGCGGCCG CCTACTTCGG CCTCGTCGAC
CCGGAGGGAC AGGCGTGA
 
Protein sequence
MSHDQPLPDQ LGHYGRFGGR FAPEALVAAL DEVAAEWEKA KQDPQYQAEL AELLKDYTGR 
PSALSEARNF SEHCGGARIL LKREDLNHTG SHKINNVLGQ ALLTRRMGKT RVIAETGAGQ
HGVATATACA LLGLECVIYM GEEDTRRQAL NVARMRMLGA EVVPVTIGSR TLKDAINEAF
RDWVANVDRT HYLFGTVAGP HPFPKLVRDL HFVVGQEARE QVLERVGRLP DAVAACVGGG
SNAMAVFAAF IPDEEVALYG FEAGGEGART TRTAASITAG SPGVFHGART FVLQDEYGQT
LPSHSISAGL DYPAVGPEHA YLADTGRATY EPVTDAEAME AFRLLCRTEG IIPAIESAHA
LAGARKLGER LGPDAVILVN LSGRGDKDVD TAAAYFGLVD PEGQA