Gene Ndas_4699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4699 
Symbol 
ID9248581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5578182 
End bp5579435 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content75% 
IMG OID 
Productprotein of unknown function DUF1205 
Protein accessionYP_003682591 
Protein GI297563617 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.363631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.334984 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTGC TCTTGGCCGC GGCCGCGGAG AAGCCGCACT TCCTCGGCAT GGTCCCGTTG 
GCGTGGGCGC TGCGCGCCGC GGGCCACGAG GTGCGGGTGG CCAGCCAGCC CGCGCTGGAG
CCCGTCGCGG CGGGGGCGGG CCTGCCCTTC ACCGCCGTCG GCAGGGACCA CGCCTTCTGG
CGCACGATGA AGGCCTTCGA CCTCCACGAC ACCCTCGACG ACGTCCCGCT GTTCGGCCGC
GTCACCGACC CCTACGAGCG GGTTCCCTGG GAGTACCTGC TGGAGGGCTA CCGCCGTGTG
GTGCCGTGGT GGTGGCGGAT GGTCAACGAC CCCATGGTCG ACGACCTGGT CGCCCTCTGC
CGCGAGTGGC GTCCCGAGCT GGTGGTCTGG GGTTCGGTGA GCTTCTCCGG GGCGATCGCC
GCCGAGGCGT GCGGGGCCGC GCACGTGCGC TACCTGTGGG GGGCCGACAT CTTCGCCCGC
ACCCGCGCGC GCTTCCTGGC GCGGATGGGC GAACAGCCCG CCTCACAGCG GGAGGACCCC
CTGGCCGCGT GGCTGGGGAC CAGGGCGGCC CGGTACGGCG TGGACTTCTC CGAGACCCTG
GTCCACGGCC AGGCCACCGT CGAGCAGGTC CCCGCGTCCC TGCGGGTGGA CACGCCCGCG
CACCTGGAGT ACCTGCCGGT GCGCTACGTG CCCTACAACG GACGCGCCGT CGTCCCCCAC
TGGCTGCGCA CACCCCCCAC CCGCCCCCGG GTCTGCGTCA CCCTGGGCAC CACCCTCATG
GGCCAGGACC GGGGCGGGGA CGTGTTCCGG GACCTCCTGG AGGGTCTGGC CGAGCTGGAC
GTGGAGGTCG TGGCCACCCT GCCCGCCCGC GAACAGGCCA AGCTCGGCAC CGTCCCCGGC
AACGCCCGCC TGGTCGAGTA CGTCCCCCTG CACGCCCTGG CCCCCACCTG CGCCGCCATG
GTCGACCACG GGGGGTGGGG GACCGTGCTC ACGGGACTGG ACGCGGGCGT GCCGCAGGTC
ATCGTCCCCA GCTGGTTCGA CGATCCGATG CTCGCCGACA TGCTCGCCGC GCGGGACGCG
GCCGTCTCCG TCCCCCACCG GACCATGACC GCCGGGGACG TCAGCACGGC GGTCTCCCGG
CTTCTGGAGG ACCCCGCCCT GGCCCGGGGC ACCGCGCGCG TGCGCGAGGC GATGCGCGCG
ATGCCCTCTC CGGCCGACCT CGCCGACGCG CTCGTCCGCC GGGCGGGGGG CTGA
 
Protein sequence
MRVLLAAAAE KPHFLGMVPL AWALRAAGHE VRVASQPALE PVAAGAGLPF TAVGRDHAFW 
RTMKAFDLHD TLDDVPLFGR VTDPYERVPW EYLLEGYRRV VPWWWRMVND PMVDDLVALC
REWRPELVVW GSVSFSGAIA AEACGAAHVR YLWGADIFAR TRARFLARMG EQPASQREDP
LAAWLGTRAA RYGVDFSETL VHGQATVEQV PASLRVDTPA HLEYLPVRYV PYNGRAVVPH
WLRTPPTRPR VCVTLGTTLM GQDRGGDVFR DLLEGLAELD VEVVATLPAR EQAKLGTVPG
NARLVEYVPL HALAPTCAAM VDHGGWGTVL TGLDAGVPQV IVPSWFDDPM LADMLAARDA
AVSVPHRTMT AGDVSTAVSR LLEDPALARG TARVREAMRA MPSPADLADA LVRRAGG