Gene Ndas_4727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4727 
Symbol 
ID9248609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5610874 
End bp5612124 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF1205 
Protein accessionYP_003682619 
Protein GI297563645 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTGC TGTTCGCGAC GCAGGCGGAG CGGACCCACT TCCTGGGGCT GGTGCCCCTG 
GCGTGGGCGC TGCGCGCCGC GGGCCACGAG GTCCGGGTGG CCAGCCAGCC CCATCTGGAG
TCCGTCGCGG CGGGGGCGGG GCTGCCCTTC ACCGCCGTGG GCCGCGACCA CCACTTCGCC
AGGATCAGCC GGGACGGCCG GGCGGACCGG CACCTGGACT TCGACATGGC CGAGGACCGC
GACGAGGTCC TGACCTGGGA CCACCTCCTC CAGGGCTACC GCGGGGTGGT CACCTGGTGG
TGGCGGACGG TCAACGACCC CATGTTCGAC GACCTGGTCG CCCTCTGCCG CGAGTGGCGC
CCCCACCTGG TCGTGTGGGA GCCGTCCACC TACGCCGCCC CCGTGGCGGC CCAGGCGTGC
GGTGCCGCAC ACGTGCGCCA CCTGTGGAGC GTGGACCTGT TCAGCAGGGT CCGCCGGACC
TTCCTGGCGC GGATGGGCGA GCAGCCCGCC TCACAGCGGG AGGACCCCCT GGCCGCGTGG
CTGGGGACCA GGGCGGCCCG GTACGGCGTG GACTTCTCCG AGACCCTGGT CCACGGCCAG
GCCACCGTCG AGCAGGTGCC CTCCCCGCTC AGGGTGGACA CGCCCGCGCA CCTGGAGTAC
CTGCCGGTGC GCTACGTGCC CTACAACGGA CGCGCCGTCG TCCCCCACTG GCTGCGTACA
CAACCCGACC GCCCCCGGAT CGGACTCAGC CTCGGCACGA CGGCGGCGTT GCGCCTGGGC
GGCTACACGG TCGACGTCGC GACCCTCCTG GAGGGTCTGG CCGAGCTGGA CGTGGAGGTG
GTGGCCACCC TGCCCGCCAG TGAGCAGGCC AAGCTCGGCG CCGTCCCCGG CAACGCCCGC
CTGGTCGAGT ACGTGCCCCT GCACGCCCTG GCCCCCACCT GCGCCGCCAT GGTCACCCAC
GGCGGCCCCG GCACCGTCCT GACCGGCCTC GCCCACGGAG TCCCCCAACT CCTGTCACCC
AACGCCCGGA TGTTCGACAT CCCGGTCCTC GCGGGGCTGG TGGAGGAGGC CGGGGCGGGC
AGGGTCGTGG ACCCCGACCG CCTGGACGCC GCCACCGTCG CCGCAGGCGT GCGCACCCTC
CTGGAGGACC CCCGCCACAC AAGCGCCGCC CGCGCCCTGC GCGCACGCAT GGACGCCATG
CCCACCCCCG CCGACCTCGC CCACACCCTC GCCGGCCTCA CCCGCACCTG A
 
Protein sequence
MRVLFATQAE RTHFLGLVPL AWALRAAGHE VRVASQPHLE SVAAGAGLPF TAVGRDHHFA 
RISRDGRADR HLDFDMAEDR DEVLTWDHLL QGYRGVVTWW WRTVNDPMFD DLVALCREWR
PHLVVWEPST YAAPVAAQAC GAAHVRHLWS VDLFSRVRRT FLARMGEQPA SQREDPLAAW
LGTRAARYGV DFSETLVHGQ ATVEQVPSPL RVDTPAHLEY LPVRYVPYNG RAVVPHWLRT
QPDRPRIGLS LGTTAALRLG GYTVDVATLL EGLAELDVEV VATLPASEQA KLGAVPGNAR
LVEYVPLHAL APTCAAMVTH GGPGTVLTGL AHGVPQLLSP NARMFDIPVL AGLVEEAGAG
RVVDPDRLDA ATVAAGVRTL LEDPRHTSAA RALRARMDAM PTPADLAHTL AGLTRT