Gene Ndas_1630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1630 
Symbol 
ID9245480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1998970 
End bp2000130 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content67% 
IMG OID 
ProductCystathionine beta-lyase 
Protein accessionYP_003679565 
Protein GI297560591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACAGGT CCGACAGGGA AATCTGCATG AGCGCGTGGG AGGGGGCGGA GAACGTCCAC 
GGCGCGGTCG CCCCTCCGGT TTTCCAGACC AGCATTTTCA CCAAACCCTC GTTCGAGGCG
TTCATCCAGG AGCAGGAACA GGAACACGAG AGGTACGTCT ACAGCCGCGG CGCCAATCCC
ACGGTGGCCT TTCTGGAGGA GAGGCTGGCC CTTCTGGAGC GCGGCGAGGC CTGTAAGTGC
TTCGGTTCCG GAATGGGGGC CATCAGCGCC GTCCTGATGA GCCTGCTGCG CGGCGGGGAC
CACATCCTCT TCGTCAACAG CACCTACGGC CCGGCCCTGG AGATGGCCGA GCACCTGCGC
GGTTTCGGCA TCGACCACAC CGTGCTGCCC GACGGCACGT CCGACATCGA GCCCCACCTG
CGCAAGAACA CGGCGCTGGT CTACGTCGAG AGCCCCGGGA CCATGCGGAT GAAGGTCCTG
GACCTGGCCG CGATCACCCG GACCGCACGG GCCAGGGGCG TCTGGACCGT GATGGACAAC
ACCTGGTCCA CGCCGCTCTT CCAGAAGCCG ATCCTGGCCG GGGTGGACAT CGTCATCCAC
TCGTGCACCA AGTACATCGG CGGTCACAGC GACGTCCTGG GCGGGGCGGT GATCGGCCCG
GCCTCCTTCG TGCGCGACCT CTTCTACACG GGGTTCCAGC TCCTGGGTTC GGTCATGTCG
GCCGTCGAGG CGTCGATGGT GCTGCGCGGG CTGCGGACGC TGCCGATCAG GATGGCCGAG
CACGAGCGCA GCGCCGTGCG GGTCATCGAC TACCTGGCGA CCCGGCCCGA GGTGGCGGCG
ATCCACCACC CCCACCACGA CCACCGGCCC GACGACCCCC TGGTCAAGGA CCAGTTCAGC
GGTTTCTCCG GGCTGCTCAG TTTCGACCTG AAGGACGGTT CCTTCGAGAA GGTCGCGGCC
TTCATCAACC GTCTTTCGCT GTTCCGGATC GGCGCGAGCT GGGGCGGTTA CGAAAGCCTG
GTCACCGCCC CCGTCCGGCC CGGAAACGAG GGGGCGTTGC GGGAGAGGGG ATTCTCCCCC
GGAATGGTCC GCCTCTCCGT GGGCCTGGAG GGGGCCGACA GCCAGATCGA GGATCTCGAA
AGGGCCTTCA CCGCACTGTA G
 
Protein sequence
MDRSDREICM SAWEGAENVH GAVAPPVFQT SIFTKPSFEA FIQEQEQEHE RYVYSRGANP 
TVAFLEERLA LLERGEACKC FGSGMGAISA VLMSLLRGGD HILFVNSTYG PALEMAEHLR
GFGIDHTVLP DGTSDIEPHL RKNTALVYVE SPGTMRMKVL DLAAITRTAR ARGVWTVMDN
TWSTPLFQKP ILAGVDIVIH SCTKYIGGHS DVLGGAVIGP ASFVRDLFYT GFQLLGSVMS
AVEASMVLRG LRTLPIRMAE HERSAVRVID YLATRPEVAA IHHPHHDHRP DDPLVKDQFS
GFSGLLSFDL KDGSFEKVAA FINRLSLFRI GASWGGYESL VTAPVRPGNE GALRERGFSP
GMVRLSVGLE GADSQIEDLE RAFTAL