Gene Ndas_4711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4711 
Symbol 
ID9248593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5591296 
End bp5592585 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF1205 
Protein accessionYP_003682603 
Protein GI297563629 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.890895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTATCT TGTTCGCAAC GTTCTCCGAG AAGACCCACT TCATCGGGAT GACCCCCCTG 
GCATGGGCGC TGCGCGCCGC CGGGCACGAG GTGCGCGTCG CCAGCCAGCC CGAACTCGCG
CCGACGGTGG CCGCGACCGG GCTGCCGTTC GTCGCCGCGG GGTCGGACCA CGTGCTCCCC
CAGGTGATCG CCTGGGTCGG GCGCATGGCG CGGGACATGC GCCCCGACTT CGACATGATG
CGCGTGGCGG CTCCGGAGGT CCCCTCCGGG GAGGAGCTGC GGGCCGCCTA CCGCGACGTG
CTGGTGCCGC TGTGGTGGAA GGTCGTCAAC GACCCGATGC TGGAGGACCT GGTCGCCTTC
TGCCGCGAGT GGCGCCCCGA CCTGGTCGTG TGGGAGCCCA TCACCTTCTC CGCGGCGATC
GCCGCGGAGG CGTGCGGTGC GGCGCACGTG CGCTTCCTGT GGAGCCTGGA CCTGTTCGCC
GCGATGCGCG AACAGTACCT GCGCCACATG GAACGACAGC CCCCACAGGA ACGCGACGAC
CCCCTCGCCG CATGGCTGGG CGACCGCGCC GCCCGCCACG GCGTCGACTT CTCCGAAACC
CTCGTCCGCG GCCAGGCCAC CCTGGACTAC CTGCCCGCCT CCCTGGGCGT GCCCGCCCCC
ACCGGAGCCC GCCGCCTGCC CATCCGCTAC GTGCCCTACA ACGGACGCGC CGTCGTCCCC
GACTGGCTGC GCACACCCCC CACCCGCCCC CGCGTCTGCC TCAGCCTCGG GACGACGGCC
ACCCAGCGCC TGGGCGGCTA CACGGTCGAC GTCGCGACCC TCCTGGAGGG CCTGGCCGAC
CTGGACGTGG AGGTCGTGGC CACCCTGCCC GCCCGCGAGC AGGAGAAGCT GGGCGCCGTC
CCCGACAACG CCCGCCTGGT CGAGTACGTC CCCCTGCACG CCCTGACCCC CACCTGCGCC
GCCATGATCA CCCACGGCGG GGCGGGCACC GTGATGTCCG GCCTGGTGCA CGGGGTCCCG
CAGTCGGCCG TGCCGCACCA CATGTACGAC GAGCCCCTGC TGGCCTCACT GGTGGCCGCG
CAGGGCTCGG GGGTGGTCGT GGACCCCTCC CGGGTCACCC CCGAGGCCGT CCGGGAGAGC
ACCCGGAGGC TGCTGGAGGA CCCCTCCCAC GCCGAGGCGG CGCGACGCCT GCGCGGGGAG
GTGGACGCCA TGCCCTCCCC CGCCGAGGTC GCGCGCCGGC TGGCGCGGGC CGCGGGGGAG
GGCGGGCGGG TGGACCTCAC ACGGTGGTGA
 
Protein sequence
MRILFATFSE KTHFIGMTPL AWALRAAGHE VRVASQPELA PTVAATGLPF VAAGSDHVLP 
QVIAWVGRMA RDMRPDFDMM RVAAPEVPSG EELRAAYRDV LVPLWWKVVN DPMLEDLVAF
CREWRPDLVV WEPITFSAAI AAEACGAAHV RFLWSLDLFA AMREQYLRHM ERQPPQERDD
PLAAWLGDRA ARHGVDFSET LVRGQATLDY LPASLGVPAP TGARRLPIRY VPYNGRAVVP
DWLRTPPTRP RVCLSLGTTA TQRLGGYTVD VATLLEGLAD LDVEVVATLP AREQEKLGAV
PDNARLVEYV PLHALTPTCA AMITHGGAGT VMSGLVHGVP QSAVPHHMYD EPLLASLVAA
QGSGVVVDPS RVTPEAVRES TRRLLEDPSH AEAARRLRGE VDAMPSPAEV ARRLARAAGE
GGRVDLTRW