Gene Ndas_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2010 
Symbol 
ID9245860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2430209 
End bp2431660 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content78% 
IMG OID 
Producttranscriptional regulator, GntR family with aminotransferase domain 
Protein accessionYP_003679942 
Protein GI297560968 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.348528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGC GTAGCAGTGT GCGGGAACTG ACCGTTTTCC TGCGCAGGGA GGTCGACCGC 
TACTCCCCCG GCGAAAAGCT GCCGTCGAGT CGGGCACTGG TGGAGCGCTA CGGCGTGAGC
CCGGTGACCG TGTCGCGGGC CGTCGCGGCG CTGGTCGCCG AGGGTGTGGT GGTCACCCGG
CCCGGCGCCG GGGCGTTCCG GGCGGGCGGG AGCGTCCCCG GGCGCGCGGT CGGCGACACC
TCCTGGCAGG AGGTCGCGCT GAGCGCCGAG ACGGCGGCGG CCTCCGGCGA ACCGGTCCCG
CGCACCGTGG ACGCCTCCGG GGTGCTGGCC ACCCTGACCC CGCCCGACCT GGGGGTCGTC
GTGTTCAACG GCGGCTACCT GCACCCCGCC CTCCAGCCCG AGCAGGCCAT GGGCGCCGCA
CTGGCCCGGG CGGGACGGCG GCCGGGAGCC TGGGCGCGCC CGCCGGCGGA AGGAGTGGAG
GAGCTGCGCG GCTGGTTCGC CCGGCAGATC GGCGGCTCCG TCGGCGCGGC CGACGTGCTG
GTCACCGCGG GCGGGCAGAG CGCGCTGACC ACCGCGCTGC GCGCCCTGGC CCACCCGGGG
GCGCCCGTTC TGGTGGAGTC GCCCACCTAC CCCGGTCTGC TGGCCGTCGC GCGTGCCGCG
GGCCTGCGGC CCGTCCCCGT CCCGGTGGAC GCCGAGGGGA TCCGCACCGA CCTGCTGGAG
CAGGCGTTCG CGGCCACCGG CGCCCGGGTG CTGGTGTGCC AGCCGCTCTT CCACAACCCC
ACCGGGACGG TCCTGGCCCC CGCCCGGCGG GGCGAGGTGG TGCGGACCGC GCGCGCCGCG
GGCGCTTTCG TGGTGGAGGA CGACTTCGCC CGCCACCTGG CGCACGCTGA CGCCGCGGCT
CCGCCGCCCC CGCTGGCGGC CGAGGACCCC GACGGCACCG TGGTGCACGT GCGGTCGCTG
ACCAAGGCGA CCTCGCCCAG CCTGCGCGTG GGTGCGATCG CCGCGCGGGG ACCGGTGACG
CGGCGCCTGC GCGCGATCCA GCTGGTGGAC AGCTTCTTCG TGGCGCGCCC CCTGCAGGAG
GCCGTGCTGG AGCTGGTCGG CTCGCCCGCG TGGGGCCGCC ACCTGCGCGC GGTGGCGGCG
GGCCTGCGCG AGCGCCGCAC GGCCATGGCC GCGGCCCTGG CGCGTGAACT GCCGGAACTG
GCGGCGCCGC ACCTGCCCGC GGGCGGACAC CACCTGTGGC TGCGGCTGCC CGGTGAGACC
GACGAGGCCG CGCTCGTGTC CGCGGCGCTG CGCGCCGGGG TGGCGGTGGC CGCAGGCCAG
GCCTACTTCC CGGCGGAGCC GACCGCCCCG CACCTGCGGC TCAGTTACGG CGGCGCGGCC
GGGACCGCCG AGATCACCGA GGGGGTGCGC CGCCTGCGCA CCGCGTTCGC CGGTACCGCA
CCCGGGGAGT GA
 
Protein sequence
MKQRSSVREL TVFLRREVDR YSPGEKLPSS RALVERYGVS PVTVSRAVAA LVAEGVVVTR 
PGAGAFRAGG SVPGRAVGDT SWQEVALSAE TAAASGEPVP RTVDASGVLA TLTPPDLGVV
VFNGGYLHPA LQPEQAMGAA LARAGRRPGA WARPPAEGVE ELRGWFARQI GGSVGAADVL
VTAGGQSALT TALRALAHPG APVLVESPTY PGLLAVARAA GLRPVPVPVD AEGIRTDLLE
QAFAATGARV LVCQPLFHNP TGTVLAPARR GEVVRTARAA GAFVVEDDFA RHLAHADAAA
PPPPLAAEDP DGTVVHVRSL TKATSPSLRV GAIAARGPVT RRLRAIQLVD SFFVARPLQE
AVLELVGSPA WGRHLRAVAA GLRERRTAMA AALARELPEL AAPHLPAGGH HLWLRLPGET
DEAALVSAAL RAGVAVAAGQ AYFPAEPTAP HLRLSYGGAA GTAEITEGVR RLRTAFAGTA
PGE