Gene Ndas_5204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5204 
Symbol 
ID9249097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp354506 
End bp355675 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content76% 
IMG OID 
ProductGAF domain protein 
Protein accessionYP_003683090 
Protein GI297564117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.412194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGAGA ACCTCGCCGG GTTGCAGCGT GAGATCACCG CACTGGGTGA ACGCATCGCG 
GCCCTGCGCA ACACGCACAC CATGTATCCC GACGACGCCC AGGGCACCGC CGAGGCGGCC
CTGGCCGAAC TGGAGTACGC CGAACGCCTG CTGGGCGACG CGGGGGCCGA ACTGGCCCGG
GCCCACGCCC AGCCCGAGCC GCGCCGCCAG GGCGACGACG GCGACCGGGC CCTGCTCCGC
GCGATGTTCC AGGAGCTGAG CGTCCCCACC GTGCTGCTGG ACCACGAGGG CTACATCCGG
CGCATCAACA ACTCCGGGGC CGCGCGGCTG GGCAGCGCGC CGGGCTACCT CACCGGCAAG
CCCTTCGCCC ACTTCGTCGA CCTGCGCAAG CGCGCGGCCA TGCAGTCCTG GCTGGCCGCG
GTGCTGCGCG GCGACGGCGA CGCCGCGCTG GAGTCCCGGC TGGCCCAGCG CGGCTGGGCC
GAGGACGTGC ACCTGACCCT GACGCGCCTG GAGCTGCCCA CCGAGCCCAA CCCGCTGGTG
CTGGTGGCGA TGTCGCCGCC GATGAACGGC GCCGAGGAGG AGGGCCCCGC CCCGCTGGAG
ACGGAGGTGG AGGACCAGGT GGTGGTGCTG GCCGCGCGCA GGCTGGACGT GCTGACCCGG
ATGACGCGGC TGCTGCTGCG CTCGGCCGGC CCGGGCGGTG CGGGCGAACC GCTCGCGCTC
GCGGACGCGG CCGACCTGCT GGCCGACTCC TACGCCGACT GGGTGGTCGT GGACGTGTGC
GACCTGCCCA CCTCCTCCGT GGCGCCCCGC CGCGCCGTGG TCGCGGGGCC GGCCGACGCG
CTCCCGGCGC AGAGGGAGGC GGTGGCCTCG GCGGCGCCCG GCGACTCCGC CATCCCGGGC
GAGGTGCTGG AGCGGGGCCA GTCCCTGCTG TTCCCGCTGA TCGAGGACGA GGCCGTGCTG
GGCCACGCCC CGTCCGGAGC GCCGCTGCTG TCGATGCTGG GCGCGGGGTC GCTGTTGTCG
GTCCCCCTGC GGGGCAGCCG GGGCGTGCGC GGCGCGCTCA CCCTGATCCG CCGCAGCAAC
CGGGGCAGCT TCCGCCTGGC CGACCTGGGC CTGATCGAGG AGATCGGCGA GCACATCGGC
CTGGCCCTGC CGCCCCGGCC CTCCGCCTGA
 
Protein sequence
MSENLAGLQR EITALGERIA ALRNTHTMYP DDAQGTAEAA LAELEYAERL LGDAGAELAR 
AHAQPEPRRQ GDDGDRALLR AMFQELSVPT VLLDHEGYIR RINNSGAARL GSAPGYLTGK
PFAHFVDLRK RAAMQSWLAA VLRGDGDAAL ESRLAQRGWA EDVHLTLTRL ELPTEPNPLV
LVAMSPPMNG AEEEGPAPLE TEVEDQVVVL AARRLDVLTR MTRLLLRSAG PGGAGEPLAL
ADAADLLADS YADWVVVDVC DLPTSSVAPR RAVVAGPADA LPAQREAVAS AAPGDSAIPG
EVLERGQSLL FPLIEDEAVL GHAPSGAPLL SMLGAGSLLS VPLRGSRGVR GALTLIRRSN
RGSFRLADLG LIEEIGEHIG LALPPRPSA