Gene Ndas_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1008 
Symbol 
ID9244854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1231574 
End bp1233196 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content70% 
IMG OID 
ProductPectate disaccharide-lyase 
Protein accessionYP_003678957 
Protein GI297559983 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACCA GACCCGCCCT CGCGTGCGCC GCCCTACTGG CAGGCACGCT CGTCACGCTT 
TCCGGCACCG CGGCGCACGC CGCCACCACC CGCTACGAGG CCGAGGGCAC CTCGGCCTCC
TGCAACGGCA GCATCGAGAC CGAGTGGCCG GGCTACTCCG GCGACGGGTT CTGCGACACC
GAGAACGAGG AGGGCGCCCA CGTGCAGTTC AGCGTGAACG CCCCCGCCTC GGGAACGGCC
ACGCTGACCG TGCGCTTCGC CAACGGCACC TCCTCCGCCC GCCCCGCCGA CGTCCTCGTG
AACGGCTCCC GGACGGCGTC GGTCTCCTTC GAGGGCACCG GCGACTGGGA CGCGTGGACG
ACCAAGACGC TCACCGTCCA GCTGGCCGAG GGCGCCAACA CGATCCGGTT CGACCCGACC
GGTTCCCAGG GCATGCCCAA CGTCGACTAC CTCGACGTGG AGACCAGCGG CGGTGGCGAC
GACGGCGGAG GCGATGACGG CGGCGGTGAC GACGGGGGCG GCGACGACGG CAACCCGCCG
ACCTCGGACG CGTTGTACGT GGCCCCGGGC GGCAGCGCGG GCGCGGACGG CACCGAGTCC
GACCCCACCA CGCTCACCTC GGCCATCGAC CGGATCGAGC CCGGCGGGAC GATCCACATG
CGCGGCGGGA CCTACTCCTT CTCGGACACG GTCACCATCC CGCTGGGCGA GGACGGCACC
TCCGGAAACC GCACGGAGCT GTCCGCCTAC CCGGGTGAGA CCCCGGTGCT GGACTTCTCC
GCCCAGAGCG AGGACTCCGC CAACCGCGGC CTGGCGCTGG AGGCGTCCTA CTGGCACGTC
GAGGGCATCG TCGTCGAGCA CGCCGGGGAC AACGGCATCT TCGTCAGCGG CAGCCACAAC
GTCATCGAGC GCACGGTGAC CCGCTTCAAC CGCGACACCG GCCTCCAGCT CTCGCGCCGC
GTCTCCAGCA CCCCCGAGAG CGACTGGCCC GCCCACAACC TCATCCTGAG CGCGGAGTCG
CACGACAACG CCGACTCCGA CGGCGAGGAC GCCGACGGCT TCGCCGCCAA GCTCACCTCC
GGCCCCGGCA ACGTCTTCCG CTACGCCGTG GCCCACAACA ACATCGACGA CGGCTGGGAC
CTCTACACCA AGGACGACAC CGGCCCCATC GGCACCGTGA CCATCGAGGA CTCCCTGGCC
TACGAGAACG GCATCCTCAG CGACGGCTCC CAGGCCGGCA ACGGCGACCG CAACGGCTTC
AAGCTCGGCG GCGAGGACAT CGGGGTCGAC CACGTCATCA CGGGCAACAT CGCCTACGAC
AACGGCAAGC ACGGGTTCAC CTACAACAGC AACCCGGGCT CGATGACGGT GTCGGACAAC
GTCAGCATCG GCAACGAGGA GCGCAACTTC AACTTCGACG ACGGCTCCTC GGTGTTCCGC
GGCAACACCT CGTGCGACAG CGGTTCCAAC GACCGGATCA TCGGCAACGC CGACAGCTCC
AACCAGTTCT GGTCCGGCTC GAACGGGTCG CGGTGCTCCT CCTACGACGG CGGCCTGGAC
TGGTCCTACG CCGCCGACGG CACCCTGGTC GTGACCTTCG GGGGCCAGCG GGTCACGCCG
TAA
 
Protein sequence
MRTRPALACA ALLAGTLVTL SGTAAHAATT RYEAEGTSAS CNGSIETEWP GYSGDGFCDT 
ENEEGAHVQF SVNAPASGTA TLTVRFANGT SSARPADVLV NGSRTASVSF EGTGDWDAWT
TKTLTVQLAE GANTIRFDPT GSQGMPNVDY LDVETSGGGD DGGGDDGGGD DGGGDDGNPP
TSDALYVAPG GSAGADGTES DPTTLTSAID RIEPGGTIHM RGGTYSFSDT VTIPLGEDGT
SGNRTELSAY PGETPVLDFS AQSEDSANRG LALEASYWHV EGIVVEHAGD NGIFVSGSHN
VIERTVTRFN RDTGLQLSRR VSSTPESDWP AHNLILSAES HDNADSDGED ADGFAAKLTS
GPGNVFRYAV AHNNIDDGWD LYTKDDTGPI GTVTIEDSLA YENGILSDGS QAGNGDRNGF
KLGGEDIGVD HVITGNIAYD NGKHGFTYNS NPGSMTVSDN VSIGNEERNF NFDDGSSVFR
GNTSCDSGSN DRIIGNADSS NQFWSGSNGS RCSSYDGGLD WSYAADGTLV VTFGGQRVTP