Gene Ndas_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3147 
Symbol 
ID9247003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3765091 
End bp3766386 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content71% 
IMG OID 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_003681062 
Protein GI297562088 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGGAC GAGTCCGCAC ACCCCGCGGT CTGGCGGCCG CCGTCGCGGT CGCGCTCGCC 
GCGACGGCCT GTACCGGCCC CGCCCGGCAG GAGCCCGCGG CGGCTCCCAG CGAGGAGGCC
CAGACCGGGC CGCGGATCAG CGTCACCCCC GAGGAGGGGG CCTCCTCCGT CGCCCCCGAC
ACCCCGGTGC GGGTGTCGGT GGAGGAGGGC TCGCTCACCG ACGTCCGCGT CGAGCAGGCC
CCCTCCGCGG AGGAGGGCGG GGACGCCGGG GCGGCCGGGG ACGCCGAGCG GTGGGAGTTC
ACCGGCACCC TCAGTGAGGA CGGCACCCGG TGGGTGAGCG ACTGGAACCT GGACCCGGGC
TCGGCGGTCA CCGTCCGCGC CACCGCCGAG GACGACGCCG GTGAGGCCTC CGAGACGGTC
GTGGAGTTCT CCACGAAGGA GGCCGTGCCC GGTCAGCGCC TCGAACTGGC CTCGAACTTC
CCCACCTCCG GCGACACCGT CGGCGTGGGC ATGCCGGTCA TCGTCAACTT CGACCTGCCG
GTGACCAACA AGGCCCAGGT GGAGAACTCC ATGGAGGTGA CCTCCGAGCA GGAGGTGGAG
GGCGCCTGGA ACTGGGTCGG CGACAAGACC GCGGTGTTTC GCCCCCGCGA GTACTGGGAG
CCCCACCAGC AGGTCAGCGT GGACATGCGC CTGTCGGGGG TGGAGGCCTC CGAGGGCGTC
TACGGGATCG AGAACCACCG CCTGGAGTTC GAGGTCGGCC GCGAGATGGT CTCGACCATG
CACGTGCCCG ACCACGAGAT GCTGGTGGAG ATCGACGGCG AGCCCGCGCG CACCATCCCC
GTGAGCAACG GCGAGGCCTC CAAGCGCTTC AACACCACCA CCTCGGGGAC GCACCTGACC
ATGGAGAAGT ACGAGTCCCT GGTCATGGAC GCGGCCACCC TGGGCATCCC CGAGGACTCG
CCGGACTACT ACAAGCTGGA CGTGGACTGG GCGGTGCGCA CCTCCAACAG CGGCGAGTTC
ACCCACGCCG CCCCCTGGAA CGACCGGATC GGGTCGGCCA ACACCTCCAA CGGCTGCACG
AACATGTCGG TGGAGGACGC CCGCTGGTTC TACGAGAACT CCCTGATGGG CGACGTCCTG
GAGACCACCG GGACCGACCG GGAGCTGGAG TGGGACAACG GCTGGGGTTT TTGGCAGCGG
TCCTGGGACG AGTGGCTGTC CCACAGCGCC ACCGGTGAGC CGCAGGTGAC CGACGGGTCG
GGCACCCCCG GTTCCGTGCA CGGCGAGGGG AACTAG
 
Protein sequence
MKGRVRTPRG LAAAVAVALA ATACTGPARQ EPAAAPSEEA QTGPRISVTP EEGASSVAPD 
TPVRVSVEEG SLTDVRVEQA PSAEEGGDAG AAGDAERWEF TGTLSEDGTR WVSDWNLDPG
SAVTVRATAE DDAGEASETV VEFSTKEAVP GQRLELASNF PTSGDTVGVG MPVIVNFDLP
VTNKAQVENS MEVTSEQEVE GAWNWVGDKT AVFRPREYWE PHQQVSVDMR LSGVEASEGV
YGIENHRLEF EVGREMVSTM HVPDHEMLVE IDGEPARTIP VSNGEASKRF NTTTSGTHLT
MEKYESLVMD AATLGIPEDS PDYYKLDVDW AVRTSNSGEF THAAPWNDRI GSANTSNGCT
NMSVEDARWF YENSLMGDVL ETTGTDRELE WDNGWGFWQR SWDEWLSHSA TGEPQVTDGS
GTPGSVHGEG N