Gene Ndas_0321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0321 
Symbol 
ID9244156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp397922 
End bp399910 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content71% 
IMG OID 
Producttranscription termination factor Rho 
Protein accessionYP_003678275 
Protein GI297559301 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.440265 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACA CCACCGAACT CCGAACGGAC GCGGCGGTCG AAGACAAAGC CACCACCGGC 
GTTTCCGTGA CGGAGGCTGG CGATGGCGCG CCGACAGCCA CGCCCTCGCG GTCCCGCACG
GCGTCGCGCG GCACCGGTCT CGCGGCCCTG AAGCTCCCCG AGCTCCAGAA GCTCGCGTCC
AGCCTGGGTA TCACCGGTAC GGGGCGCATG CGCAAGAGCG ACGTCATCGC CGCGATCGAG
GCCAAGCAGG GCGGCCCGGT CGGCGGCCCC ACGAAGGCCA AAACAGCAAA GAGCGCAGAG
ACGGCGTCCA AGGCCGGTAA GGTCGAGGCA ACCGACGGCC GGGCCGAGGC TCCCGAGACG
CCGGGCACGG ACGAGGCCCC GCGCAAGCGC GCGGACAAGC AGCCGTCGGA CCAGGCCGTG
ACCGACCCCC AGGGCGGACG GGGCGACAAG CCCTCCCGCG GTCGTCGTTC GTCCCGCCGA
CGCGGTGACG AGCCCGGCGA CCAGCCCCGG GCCGTGGACG GCGCCGAATC CTCTACCTCC
GCGAGCAGCG TGACCAAGAC ATCCAGCACC CCCCAGAAGA ACGGCCCCGA CTCCGCCGAG
GACCGTGACA ACAGGTCCGG GCAGGGTCGC GAGCGCCAGC GCAACCGCCG CAACCGCAAC
CGCGGCGGTG ACGACCAGAA CGCGAACAGC AACGCCCAGC AGGGTGGTGG CGGCCAGAAC
CAGGGCCGCG GCTCCGGTGG TGGCGGTGGC GACGACGACG ACTTCGGCGG ACGCCGCCGC
GGACGCCGCC GGGACCGTCG GGACCGCCGC GGACGCGGCG GGGGCCAGGA GCCGGAGCCG
GTGATCGGCG AGGACGACGT CCTGCTGCCG GTCGCGGGCA TCCTCGACAT CCTGGACAAC
TACGCCTTCG TGCGCACCAC CGGCTACCTC CCCGGCCAGA GCGACGTCTA CGTCTCCCTG
GCCCAGGTCC GCAAGCACGG CCTGCGCAAG GGCGACCACA TCATCGGCGC GGTCCGCCAG
CCCAAGGACG GCGAGCGCAG GGAGAAGTTC AACGCCCTGG TCCGCCTGGA CTCGGTCAAC
GGCATGTCGC CCGACCAGGC CAGGGGCCGC CAGGAGTTCT CCAAGCTGGT CCCCCTGTAC
CCCCAGGAGC GCCTGCGCCT GGAGACCGAG CCGCAGATCC TCACCACGCG CATCATCGAC
CTGGTGGCGC CCATCGGCAA GGGCCAGCGC GGGCTGATCG TCTCCCCGCC CAAGGCGGGC
AAGACGATGG TGGTGCAGGC GATCGCCAAC GCCATCACCG AGAACAACCC CGAGTGCTAC
CTGATGGTGA TCCTGGTCGA CGAGCGGCCC GAGGAAGTCA CCGACATGCA GCGCACGGTC
AAGGGCGAGG TCATCCACTC GACCTTCGAC CGGCCCGCCG AGGACCACAC GGTCGTCGCC
GACCTGGCCA TCGAGCGCGC CAAGCGGCTC GTGGAGATGG GCATGGACGT CGTCGTCCTG
CTGGACTCCA TCACCCGCCT GGGCCGCGCC TACAACCTGG CCGCCCCGGC CAGCGGGCGC
ATCATGTCCG GCGGTGTGGA CTCCACGGCG CTCTACCCGC CCAAGCGCTT CTTCGGCGCG
GCCCGCAACA TCGAGGGCGG CGGCTCGCTG ACCATCCTGG CCACGGCGCT GGTCGAGACC
GGCTCGCGCG CCGACGAGGT GATCTTCGAG GAGTTCAAGG GCACCGGCAA CATGGAGCTC
AAGCTCAACC GGAGCCTGGC CGACAAGCGG ATCTTCCCGG CGGTGGACGT GGACGCGTCC
AGCACCCGCA AGGAGGAGAT CCTCATGTCC TCCGAGGAGC TGGGCGTGGT CTGGAAGCTG
CGCCGGGTGC TGCACGCGCT CGACACCCAG CAGGCCATCG AGCTGCTCCT GGACAAGATG
AAGGAGTCCA AGAGCAACGC CGAGTTCCTG CTCCAGATCC AGAAGACCAC CGTGGGCCCC
GAGCGCTGA
 
Protein sequence
MSDTTELRTD AAVEDKATTG VSVTEAGDGA PTATPSRSRT ASRGTGLAAL KLPELQKLAS 
SLGITGTGRM RKSDVIAAIE AKQGGPVGGP TKAKTAKSAE TASKAGKVEA TDGRAEAPET
PGTDEAPRKR ADKQPSDQAV TDPQGGRGDK PSRGRRSSRR RGDEPGDQPR AVDGAESSTS
ASSVTKTSST PQKNGPDSAE DRDNRSGQGR ERQRNRRNRN RGGDDQNANS NAQQGGGGQN
QGRGSGGGGG DDDDFGGRRR GRRRDRRDRR GRGGGQEPEP VIGEDDVLLP VAGILDILDN
YAFVRTTGYL PGQSDVYVSL AQVRKHGLRK GDHIIGAVRQ PKDGERREKF NALVRLDSVN
GMSPDQARGR QEFSKLVPLY PQERLRLETE PQILTTRIID LVAPIGKGQR GLIVSPPKAG
KTMVVQAIAN AITENNPECY LMVILVDERP EEVTDMQRTV KGEVIHSTFD RPAEDHTVVA
DLAIERAKRL VEMGMDVVVL LDSITRLGRA YNLAAPASGR IMSGGVDSTA LYPPKRFFGA
ARNIEGGGSL TILATALVET GSRADEVIFE EFKGTGNMEL KLNRSLADKR IFPAVDVDAS
STRKEEILMS SEELGVVWKL RRVLHALDTQ QAIELLLDKM KESKSNAEFL LQIQKTTVGP
ER