Gene Ndas_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1800 
Symbol 
ID9245650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2200618 
End bp2201778 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID 
Productputative transposase IS891/IS1136/IS1341 family 
Protein accessionYP_003679734 
Protein GI297560760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.31256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.500943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGACGG GGTTTCGGTA CCGGCTCGCG CTCACCGACG AGCAGGCCGA GTCGTGCCAG 
GTCTATGGCG ATATCTGTCG GGCGGTGTGG AACACCGGCC TTGACCAGCG CCGTGAGGCC
GTGTCCCGTT GGCGGCGCGG CCAGCGCCTG CCCTACTGCG GCTACCACCT CCAGGCCCGT
GAACTGGCCG AGGCCAAGAC CGAGGAGACC TGGCTTCAGG CCGCCCCCTC CCACATCCTC
CAGCAGACCC TGCGCGACCT GGACCGGGCC TGCCGGGACC ACGGCACGTT CAAGGTGCGA
TGGCGTGCCA AGGACCGGTG GAAGCCCTCC TTCCGCTTCC CCGACAGCGC ATGGATGAAG
GTCGAACGCC TGGGCCGCAG GTGGGCACGG GTCGAGCTGC CCAAGCTGGG GTGGGTACGG
TTCCGCTGGT CGCGCGCACC CCGGGGCACG GTCCGTTCGG CCACCGTCTC CCGCGACGGG
GCCTACTGGT ACGTGTCGCT GTTGTGCGAG GACGGCCAGG CCACACCGGA GGCACATGAG
CGTCCCGACA GCGCGGTAGG TGTGGACCGG GGTGTGGCGG TGGCGGTGGC CACCAGCGAT
GGAGACCTGC TCGACCAGGT GTTCCAGACC CCGAAGGAGG CCGAGCGCGA ACGCCGTCTG
CGCCGACGCC TGTGCCGCCA ACGCAAGGGG TCGGCGAACC GGGCCAGAAC CAGGGCCGCC
CTGTCCGCGT TGACCGGGCG GGTGCGGGCT CGGCGCTCCG ATTTCGCCGC CCAGACCGCG
CACACGCTGT GTGCCAAGAA CGCGGTCGTG GTGCTGGAAA AGCTCAACAC CACGAACATG
ACGGCCTCCG CGAAAGGCAC TGTTGAGGTG CCCGGCGTCA GCGTGCGTCA GAAGGCGGGG
TTGAATCGGG CGATTCTGGC CAAGGGCTGG CACGGCCTCA AGCTGGCCTG TCACAACGCG
GCCCGGCGCA CCGGGACCCG GATCGTGGAG GTTGATCCCG CGTACACGTC CCAGACCTGT
CACTCGTGCG GATACGTCGC GGCGGAGAAC CGAGAGAGCC AATCGGTCTT CTGCTGCGGC
AGGTGCGGGC ACACAGCGCA TGCGGACGTG AACGCGGCCC AGAACATTCT CACGCGCGGA
TGGACTAGCC CTTCGGGGTG A
 
Protein sequence
MLTGFRYRLA LTDEQAESCQ VYGDICRAVW NTGLDQRREA VSRWRRGQRL PYCGYHLQAR 
ELAEAKTEET WLQAAPSHIL QQTLRDLDRA CRDHGTFKVR WRAKDRWKPS FRFPDSAWMK
VERLGRRWAR VELPKLGWVR FRWSRAPRGT VRSATVSRDG AYWYVSLLCE DGQATPEAHE
RPDSAVGVDR GVAVAVATSD GDLLDQVFQT PKEAERERRL RRRLCRQRKG SANRARTRAA
LSALTGRVRA RRSDFAAQTA HTLCAKNAVV VLEKLNTTNM TASAKGTVEV PGVSVRQKAG
LNRAILAKGW HGLKLACHNA ARRTGTRIVE VDPAYTSQTC HSCGYVAAEN RESQSVFCCG
RCGHTAHADV NAAQNILTRG WTSPSG