Gene Ndas_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2140 
Symbol 
ID9245990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2559397 
End bp2562858 
Gene Length3462 bp 
Protein Length1153 aa 
Translation table11 
GC content80% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003680068 
Protein GI297561094 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.032223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGGAG AGGCGACGCA GGCGGCGGTG TCCTTCCGCG TGCTGGGCCC TCTGGAGGCC 
GTCGGCGCGC ACGGGCCGCT CGCCCTCAAG GGGCCGCGGC ACCGCGCGGT GCTGGCCCGG
CTCCTGGTCG CCGAGGGCCG TGCCGTGCCC GTGGACCGCC TCGTGGACGA CCTGTGGGAG
GCGCCCGCCG AGGGCTCCGT GGCGGCGGTG CGCACCTTCG TCTCCGCGCT GCGCCGGTCC
CTGGAACCGG ACCGGCCCGC GCGGCGACCC GCCCGGCTGC TGGTGACCGC GCCACCGGGT
TACGCGCTGC GGGCCGGACC CGACGCGGTC GACGCCCGGC GCTTCGCCGC GGCCGTGGCC
CGCGGCGGAG CGCTGCTCAC CGAGGACCGG CCCGGAGCGG CGCTGGACGG GCTGGAGGAG
GCCCTCGGGT GGTGGCGGGG GCCCGCCTAC GCCGAGTTCG CCGCCTACCC CTGGGCGCGG
GCCGAGGCCG ACCGGCTGGA GGGGCTGCGG CTGCTGGCGG TGGAGCGGCA CGCCGAGGCC
CTCCTGGCGC TGGGCCGTGC GGGCGACGCC GTGCCCGCCC TGGAGGCGCA CGCCCTGGCC
CACCCCCTGC GCGAGAACGC CTGGCGGCAG TGGGCGCTGG CCCTGTACCG CTCGGGCCGC
CAGGGTGACG CGCTGGCCGC CCTGCGCCGC GCCCGCCGGA CCCTGGCGGA CGAACTGGGG
GTGGACCCCG GCCCCGAACT GCGGCGGTTG GAGTCCGACG TCCTGGCCCA GGCTCCCCAC
CTCATCCCGC GTGCCGCCAC CGCGGTTCCG GCACGGGCGG AGAGCGCGCC CGAACGCGCC
CAGCGTCCCT TCGTGGGCCG GGCCCGGGAG CTGGAACTGC TGGAGGGGGC CGCCGCCTCC
GCCGCGCCGC GCGCTCCGGC CCGGGTCGCG CTGGTCTCCG GCGACGCCGG GGCGGGTAAG
ACGGCCCTGG CCGAGGAACT CGCGCGGCGG CTGGCCGGGC GCGGGTGGAC CGCGGCCTGG
GGGCGCGGCC CCGAGCACGA GGGGGCGCCC GCGGTCTGGC CCTGGACGCA GATCGCCGCC
GTGCTGACGG CGGCCGCGGA CACCGCGACC CACGGGGTCG CCACCGGCCC GGCACAGGCC
GCGGAACACG CCGCCGACCC CGGCGCCCCG CGCGCTGCCA CCACCGACCC CACCGGCACC
GGATCCGGGG CCCCGCGCAT CGATACCGCC ACGGCCACGG GCATCACAGC GCCCAGCGGC
ACCGCTGCTC CCGCCCGGGA CGACGACACC ACAGCGCCCG GGGTCGCCAG CGCGGACCCC
GCCGCGGCCC GGTTCGGCGT CCTGCGCCGG GCCGCCCGCC TCCTCGCCTC GGCCACGCGG
CGCGGCCCCG TCCTGCTGGT CTTCGACGAC CTGCACCGGG CGGCGGAGGA GACCCTGGAA
CTGTTCACCT TCCTGGCCGC CGAACCGCTC CCCGGGCCCG TGCTGCTCGT GGGGACCTAC
CGGACCGGCG AGGTCCTGCC CGCGCTGACC GCGGCCCTGG CCCGGCTCGC CCCCGGCGAA
CCGGCCCGCG CCTACCTGGG CGGGCTGGCC GAGGACGCGA CCGGCGCGCT CGTACGGGCC
CTGGTCGGCC GCGACGTCGA CGGGCGGGCG CTGCGCACCG TCCACCGGCG CAGCGGCGGC
AACCCCTTCT TCGTCCGTGA GCTGGCCCGG TTGCTGTCCG ACGGGGACGG CGCCGCGCTC
GACGCGGTCC CCGCCGGGGT GCGCGACGTC ATCCGCCACC GCCTGGGCTC CCTCACGCCC
GGGGCCCGGG CCCTGCTGCG GCAGGCCTCG GTGATCGGCC GCGACATCGA CCCCGACGTC
CTGTCGGCCC TGTCCCCGGA CGGGGACGCC CTGCTCGACG CCCTGGACGA GGCGCTGGAG
GCGGGCTTCC TCACCGACCG CGCCGAACCG GACGAGCCCG ACGGCGCCCC CGGCCTGCGC
TTCGCGCACG TGCTGGTGCG CGACACCCTC TACCAGGACC TCTCCCGACC GCGCCGGGCG
CACTGGCACA CGGCGGTGGC CGAGGCCGTC GAGGCGCTGC ACCCCGACCG GGCCGACCTG
CTCGCGCACC ACTTCGGGCG GGCGGGAACC CGCGCCACCG CGACCCGGGC CGCCCACCAC
GCCCGCACAG CCGCCCTGCG GGCCGAGGAG CGCTTCGCCC CGCACGAGGC GGCCCGGCTG
TGGCGGGAGG CCGTCGCCGC CCACGACCGC TCCGGCGAGG ACCGGCCCCG CGAACGCCTG
GAGGCGGTCA TGGGCATGGT GCGGGCGCTG GCCGTCACCG GCCGCCTGGA GGAGGCCCGC
CACCACCGCG CCCGGGCGGT CGCCGCGGCC GAGGAGCTGG GTGACGCGGA ACTGACGGCG
CAGGTCATCA CCGCCTTCGA CGTGCCCGCC CTGTGGCCGC GCAACGACGA CGAGGAGCTG
GCCCGCCAGA TCGCCGGGGC CGCCGAGCGC ACCCTGGCCG CGCTGCCCGA GGACCGCCCC
GAGCAGCGCG TCCGCCTGCT GTGCGCCCTG GCCCTGGAAC TGCGCGGCAC CGCCACCGGC
CGGGGCCCGG ACGCGGCCCG CCGGGCCGAG GAGGCCGCCC GCGGGCTCGG CGACCCCGCC
CTGCTGTGCC TGGCGCTCAA CGCCCGCTTC ATGCAGTCCT TCCAGGGCTC CGGGCGGGCC
CCGCAGCGCG TGGAGATCGG CGCGGAGCTG GTCGACCTGG CCTCTCGGCA CGGCCTGGTG
ACCTTCGAGG TGCTGGGCCA CCTCGTCCTG GTCCAGGCGC ACTCCGCGCT CGCCGACTTC
GGTGCCGCCG ACGCCCACGC GGCCGACGCC GACCGGCTGG GCTCGCGCTA CGGGATTCCG
CTGGTGGGGG TGTTCACCCG CTGGTACGAG GCACTGCGCA CGGCCGCGCG GGGGGCCGTC
GAGGAGGCCG AGGCCGCCTA CCGTGCCGCG AGCGTGCGGC TCGCCGACAC CGGCATGCCC
GGTGTGGAAC AGGGCATCCT GCCCCTGGCG CTGCTGTGTC TGCGCCTTCA GGGCGGCCGA
CCCGCACCGG TGGACCCGCG CCAGGACTGG GGCCCCTACG CGCCCTGGGC CGACGCCCTG
GCCTCGCCCG AGTCCGCGCC CCCGCCGCCC GACGCACCGC CCGGCCTGCT CGGGGAGGCT
CTGACCTGTC TGGCCGCCCG GGCCGCCACC GCCGTCGGCG ACCACCCCGC GATGGAGCGC
GCCCACCGCC TGCTCTCACC CGCCGCCGGG GAGCTGGCCG GTGCGGGCAG CGGTCTGTTG
ACCCTGGGAC CGGTCGCGCA CCAGCTCGGC GACCTCGACC GCGCGCTCGG ACGCCGTGAG
CGGGCCGCGG AGCACTACCG GCTGGCCCTG CGCGTGGCCA TCCGGGCCGG GTCGCCGCAC
TGGACAGCCG CGGCCCGGGC GGCCCTGGCC GACCTGGGCT GA
 
Protein sequence
MGGEATQAAV SFRVLGPLEA VGAHGPLALK GPRHRAVLAR LLVAEGRAVP VDRLVDDLWE 
APAEGSVAAV RTFVSALRRS LEPDRPARRP ARLLVTAPPG YALRAGPDAV DARRFAAAVA
RGGALLTEDR PGAALDGLEE ALGWWRGPAY AEFAAYPWAR AEADRLEGLR LLAVERHAEA
LLALGRAGDA VPALEAHALA HPLRENAWRQ WALALYRSGR QGDALAALRR ARRTLADELG
VDPGPELRRL ESDVLAQAPH LIPRAATAVP ARAESAPERA QRPFVGRARE LELLEGAAAS
AAPRAPARVA LVSGDAGAGK TALAEELARR LAGRGWTAAW GRGPEHEGAP AVWPWTQIAA
VLTAAADTAT HGVATGPAQA AEHAADPGAP RAATTDPTGT GSGAPRIDTA TATGITAPSG
TAAPARDDDT TAPGVASADP AAARFGVLRR AARLLASATR RGPVLLVFDD LHRAAEETLE
LFTFLAAEPL PGPVLLVGTY RTGEVLPALT AALARLAPGE PARAYLGGLA EDATGALVRA
LVGRDVDGRA LRTVHRRSGG NPFFVRELAR LLSDGDGAAL DAVPAGVRDV IRHRLGSLTP
GARALLRQAS VIGRDIDPDV LSALSPDGDA LLDALDEALE AGFLTDRAEP DEPDGAPGLR
FAHVLVRDTL YQDLSRPRRA HWHTAVAEAV EALHPDRADL LAHHFGRAGT RATATRAAHH
ARTAALRAEE RFAPHEAARL WREAVAAHDR SGEDRPRERL EAVMGMVRAL AVTGRLEEAR
HHRARAVAAA EELGDAELTA QVITAFDVPA LWPRNDDEEL ARQIAGAAER TLAALPEDRP
EQRVRLLCAL ALELRGTATG RGPDAARRAE EAARGLGDPA LLCLALNARF MQSFQGSGRA
PQRVEIGAEL VDLASRHGLV TFEVLGHLVL VQAHSALADF GAADAHAADA DRLGSRYGIP
LVGVFTRWYE ALRTAARGAV EEAEAAYRAA SVRLADTGMP GVEQGILPLA LLCLRLQGGR
PAPVDPRQDW GPYAPWADAL ASPESAPPPP DAPPGLLGEA LTCLAARAAT AVGDHPAMER
AHRLLSPAAG ELAGAGSGLL TLGPVAHQLG DLDRALGRRE RAAEHYRLAL RVAIRAGSPH
WTAAARAALA DLG