Gene Ndas_1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1779 
Symbol 
ID9245629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2177271 
End bp2178389 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content66% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003679713 
Protein GI297560739 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.238729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.359878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGCG TGTCGAAGAT CGTCGTGCAG CTCAAGCTCA CGCCGTCGCC CGAGCAGGCG 
GCGGTGCTGA CCTCGACCCT GCGCGATCTC AACACCCACA CCACCTGGGT GGCGAGGGTG
GCCCACGAGC AGGGTGTGAT GCGCGACTAC GAGCTGCGCA AACATACCTA CCAGCAGTTG
CGTGAGGCCG GGGTGGGCTC GCAAGCGGCC CAGCACGTGA TCAAGAAGGT GTGCGACGCC
TACCACGCCC GCCGCTCCAA TCTGAACAAC GGCAACTACG GACCCCAAGG ATCGGCCCTC
CGGGAGCGGA TCGAATCCAC ACCGATCGCC TTCCGCCCCG ACTCGGCGCA CCCCTACGAC
GCGCGCGACC TGTCCTTCGC CATGGACGCG CGCACGATCT CACTGTGGAC CTTCCAGGGC
CGATTGAAGG ACGTGCCCTT CGTCGGCTCC CCCGACCAGA TCAAGATGCT GGCCGAACAC
AAACGCGGTG AGGCCGACCT GCTCTGCCGC GACGATGCCT GGTTTCTCGC GGTGACCGTC
GAGGTGCCCG ACGCTCCTGA GATCGACCCC AACGGGTTTC TTGGGGTGGA TCTGGGGATT
GTCAACATCG CCACCACCAG TGACGGCCGG GTCATGGCCG GGCGCCAGAT CAACCGGTAC
CGCCGTCGGC AGCTCAGGCT GCGCCAGAAG TTGCAGGCCA AGGGCAGCCG GTCCGCCAAG
CGCCTGCTCA ACAAGCGGCT CCGCCGTGAA GCACGGTACG CCAGAAACAT CAACCACCAG
ATCTCGAAAC GCATCGTGGC CGAGGCCGAA CGCACCGGGC GCGGTATCTC CCTTGAGGAT
CTCAGGGGGA TCCGCGCCCG GGTACGGCAA CGCAGGCCCC AACGGGTCAC GCTGCACTCC
TGGTCCTTCC ACCAACTGGG CGCCTTCATC GCCTACAAAG CGCGCCTGGA GGGCGTGCCG
GTGGTGTTCG TGGACCCGGC GCACTCCTCA CGCGAGTGCG CCGCATGCTC CTACACTCAC
AAGGCCAACC GGGTCTCACA GGCCTTGTTC GTCTGTCGGG ACTGCGGCGT CGTTGCGCAC
GCGGGGCGCA AGTCACACGT CCCACCCGAC CACCCCTAG
 
Protein sequence
MVGVSKIVVQ LKLTPSPEQA AVLTSTLRDL NTHTTWVARV AHEQGVMRDY ELRKHTYQQL 
REAGVGSQAA QHVIKKVCDA YHARRSNLNN GNYGPQGSAL RERIESTPIA FRPDSAHPYD
ARDLSFAMDA RTISLWTFQG RLKDVPFVGS PDQIKMLAEH KRGEADLLCR DDAWFLAVTV
EVPDAPEIDP NGFLGVDLGI VNIATTSDGR VMAGRQINRY RRRQLRLRQK LQAKGSRSAK
RLLNKRLRRE ARYARNINHQ ISKRIVAEAE RTGRGISLED LRGIRARVRQ RRPQRVTLHS
WSFHQLGAFI AYKARLEGVP VVFVDPAHSS RECAACSYTH KANRVSQALF VCRDCGVVAH
AGRKSHVPPD HP