Gene Ndas_4700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4700 
Symbol 
ID9248582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5579642 
End bp5580916 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF1205 
Protein accessionYP_003682592 
Protein GI297563618 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.626027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.337723 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGTCG TCGTGGCGTC CCTCGCCGAG AAGACGAACT TCCTGAGCCT GGTGCCCCTG 
GCGTGGGCGC TGCGCGCCGC CGGGCACGAG GTGCGGGTGG CCAGCCAGCC CGCGCTGGAG
CCCGTGGTGC GGGAGACGGG CGTGCCGTTC GTCGCGGTGG GGCGCGACCA CGGGTTCTGG
CGCCATCTCA CCGCCCGGTC CTCCTTCGAC GGGATGCGGG GAGGCGTCCC CCTCTTCTCC
GTGTACGGCC GGGGCGGGCC GGAGGGCTCC TGGGAGGAGA CCCTGGAGGA GTACCGGCAG
GTCGTCACCT GGTGGTGGCG GATGGTCAAC GACCCCATGG TCGACGACCT GGTCGCCCTC
TGCCGCGAGT GGCGCCCCGA CCTGGTCGTG TGGGAGCCCA TCACCTTCTC CGGGGCGATC
GCCGCCGAGG CCTGCGGGGC CGCGCACGTG CGCTATCCCT GGGGCGCGGA CGTGTTCGGC
GCCGTACGCG CGCGCTTCCT GGCGCGGATG GGCGAACAGC CCGCCTCACG GCGGGAGGAC
CCCCTGGCCG CGTGGCTGGG GACCAGGGCG GCCCGGTACG GCGTGGACTT CTCCGAGACC
CTGGTCCACG GCCAGGCCAC CGTCGAGCAG GTCCCCGCGT CCCTGCGGGT GGACACGCCC
GCGCACCTGG AGTACCTGCC GGTGCGCTAC GTGCCCTACA ACGGACGCGC CGTCGTCCCC
GAATGGCTGC GCACACCCCC CACCCGCCCC CGGGTGGCCC TGTGTCTGGG CACCAGCACG
GCGGCGTGGC TGGGCAGGTT CGGGGTGGAC GTGGCCACGG TTCTGGAGGG TCTGGCCGAG
CTGGACGTGG AGGTGGTGGC CACCCTGCCC GCCAGTGAGC AGGCCAAGCT CGGCGCCGTC
CCCGGCAACG CCCGCCTGGT CGAGTACGTG CCCCTGCACG CCCTGGCCCC CACCTGCGCC
GCCATGATCA CCCACGGCGG GACGGGCACC GTCCTGACCG GTCTGGCCCA CGGGGTCCCG
CAGCTCGTCT CGCCCCGGCC CACCTTCGAC GAACCCCTGC TGGCCTCGTC GGTCGCGGCC
GAGGGCGCGG CGCTGGTCGT GGACCCCGAC CGCATGGACG CCGCCACCGT CACCGCCGGC
GTACGCGCCC TCCTCGAAGA CCCCCGCCAC ACAAGCGCCG CCCGCGCCCT GCGCGCACGC
ATGGACGCCA TGCCCACCCC CGCCGACCTC GCCCACACCC TCACCGCGCG CCGGCCCGCA
TTCCGAAGCA AATGA
 
Protein sequence
MRVVVASLAE KTNFLSLVPL AWALRAAGHE VRVASQPALE PVVRETGVPF VAVGRDHGFW 
RHLTARSSFD GMRGGVPLFS VYGRGGPEGS WEETLEEYRQ VVTWWWRMVN DPMVDDLVAL
CREWRPDLVV WEPITFSGAI AAEACGAAHV RYPWGADVFG AVRARFLARM GEQPASRRED
PLAAWLGTRA ARYGVDFSET LVHGQATVEQ VPASLRVDTP AHLEYLPVRY VPYNGRAVVP
EWLRTPPTRP RVALCLGTST AAWLGRFGVD VATVLEGLAE LDVEVVATLP ASEQAKLGAV
PGNARLVEYV PLHALAPTCA AMITHGGTGT VLTGLAHGVP QLVSPRPTFD EPLLASSVAA
EGAALVVDPD RMDAATVTAG VRALLEDPRH TSAARALRAR MDAMPTPADL AHTLTARRPA
FRSK