Gene Ndas_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3626 
Symbol 
ID9247495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4349184 
End bp4350869 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content74% 
IMG OID 
Producthydrolase CocE/NonD family protein 
Protein accessionYP_003681532 
Protein GI297562558 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCTG CCAGAACCCC CTTCCGCCCC GCGCCCTGCG AGGACGGACC GCCCCGGACC 
GGCTCCGGGC TCCGCGCCGC CGCGCGACTC CTCCCCCGCC TGCTGTCCGT CGCCCTGGCG
GCCGTGCTCG CCGCCGCGGT CCTCTCCCCC GCCCCGGCCT CGGCCGCGGT CGGCGCCAGC
ATCGGCTACG AGACCATCGT CAGCCACGAC GGCACCGAGC TCAGGGCCAA GGTCATCACC
CCCACCGGGG TCGAGGGCCC GCACCCGCTG CTGGTCATGC CCTCCGCCTG GGGCACGCCC
CACCTCCTCT ACGTGGGCGC CGCCGCCAGG CTGGCCCACG AGTCCGGCTA CCAGGTCGTG
GCCTACACCT CGCGCGGCTT CTGGGACTCG GGCGGCGGGA TCGGGGTGGC CGGTCCCGAG
GACCGCGCGG ACGCCAGCGC CGTCATCGAC TGGGCGCTGG AGAACACCGA CGCCGACCCC
GACCGCATCG GCATGGCCGG GATCTCCTAC GGGGGCGGCA TCAGCCTGCT GACCGCCGCC
GAGGACGACC GGATCCGCGC CGTGGCCTCG CTGAGCGGCT GGGCCGACCT GGCCGTCTCC
CTGTACCCCA ACGAGACCGT GGACACGCAG TCGGCCGAGC TGCTCCTGCT GGCGGGCCAC
CTCACCGGCA CCCCCGGCGA GGAGCTGGCC GGGATCGAGG CCGCCTACCG GCGCGGCGAC
ATCCAGCCCG CCCTGGACGT GGCCCCGGAC CGGTCGGCGG CCACCAAGGT GGACGCCATC
AACGCCAACG GCACGGCCGT GATGATGGCG CACGCCTGGA ACGACGGCAT CTTCCCCGTG
GGCCACATCA CCGACTTCTA CGAGGAGCTG GAGGTGCCCA AGCGGCTCAT GATCACCCCG
GGCGACCACG CCACGCAGGA GCTGTTCGGC GCGGCCGGGC TGCCCAACGA GGTGTGGGAG
GCCCTGGGCG ACTGGTTCGA CCACCACCTG CGCGGCCAGG CCAACGAGGT GGACGCCGAG
GGGCCCGTGC ACGTCAGGCC CAACAACGGG CGCGGCGGCT GGACCACGCA CCCCGACTGG
CCGTCGGTCA CCGCGAGCAC CGACCGGTAC TACCTGGCCG AGCCGAGCAC GGACTGGTTC
CGCTGGCAGA GCAGCGGCGC CATGGAACCC GAGCCCTCCA CCGGATGGGA CTACCGCTTC
CGCACGGGGA TCGCCACCCC GGCCGAGAGC GGCACGGTGA TGCTCTCCGG CGCCCTCCAG
CAGTTCTTCG ACATCCCGAC CGGGGTGCTG CTGCCGACGG TGGACCGCTA CCGGGCCGCC
GTGTGGACCA GCCCCGCCTA CCCCGAGGGG GCGCGGGTGG CGGGGACGCC GGAGGTCTCC
CTCACGTTCA CGCCGACCGC TCCGGAGCAG TCGGTGTACG TGTACCTGTA CGCGATCGAC
GGGCGCGGAA CGGGTTCCCT GCTGTCGCAC GCGCCCCACA CCCTGCGCGG AGCCACTCCC
GGTGAACCGG TGACGGTGGA CACGGAGCTG GCGCCGGTGG TCTGGGACGT GCCCGCGGGG
CACCGCCTGG CCGTGGTGGT GGACAGCATG GACGCGCGCT ACCAGGACGA GAGCGACATC
GGGGACCGGG TGAGCGTGAC CTCCCCCGAG GAGGCCCCCG CCCAGGTCAC GGTCCCGCTG
GCCTGA
 
Protein sequence
MRPARTPFRP APCEDGPPRT GSGLRAAARL LPRLLSVALA AVLAAAVLSP APASAAVGAS 
IGYETIVSHD GTELRAKVIT PTGVEGPHPL LVMPSAWGTP HLLYVGAAAR LAHESGYQVV
AYTSRGFWDS GGGIGVAGPE DRADASAVID WALENTDADP DRIGMAGISY GGGISLLTAA
EDDRIRAVAS LSGWADLAVS LYPNETVDTQ SAELLLLAGH LTGTPGEELA GIEAAYRRGD
IQPALDVAPD RSAATKVDAI NANGTAVMMA HAWNDGIFPV GHITDFYEEL EVPKRLMITP
GDHATQELFG AAGLPNEVWE ALGDWFDHHL RGQANEVDAE GPVHVRPNNG RGGWTTHPDW
PSVTASTDRY YLAEPSTDWF RWQSSGAMEP EPSTGWDYRF RTGIATPAES GTVMLSGALQ
QFFDIPTGVL LPTVDRYRAA VWTSPAYPEG ARVAGTPEVS LTFTPTAPEQ SVYVYLYAID
GRGTGSLLSH APHTLRGATP GEPVTVDTEL APVVWDVPAG HRLAVVVDSM DARYQDESDI
GDRVSVTSPE EAPAQVTVPL A