Gene Ndas_3886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3886 
Symbol 
ID9247757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4657174 
End bp4658382 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAbortive infection protein 
Protein accessionYP_003681789 
Protein GI297562815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.539919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAATG TGCCGCCACC GCCTGGTTCG TCCTGGCCCG CCGCGCCCGA GGGGCCGGAG 
GGGCGGACTG TCGGGGCCTA CACGCAGGAG CAGACCTGGG CGGATCCTCC CGGAGGGGGG
AGGCTGACGC CGCCCGGGGG CTACGCCGTG GCCGCCACCG ACCCGGACGT GTGGGCCTGG
CCGCCGCCGA GGCCGGTGCG CAGGAACCAC TTCGCCCGCG CGGAACTCCC CGCGGACGAG
TCCTACCACA GGCTGGGCCG CACGTCCCGC TTCCGCTGGT GGTTCACGCC GCTCACCCTG
GCCGTCCTCG CGGTGCTGCT CTTCTTCCTG TGGATAGCCG TCATCCTGTC GGTGACGATC
GTGGCGATCA TCAACGGCAG CGGACTGGCC CCGGACTCGG TCACCCTGAG CGTGATCGCC
GAGATGGCCT TCGGGCTCCT GTCGTCGGCG CTGTTCATCC CCATCGTCCT CTTCCTGGTC
CGCGTCGTGC AGTGGCGCCG GACCGGCTCC CTGTTCTCCG TCGAGGGACG GCTGCGCTGG
GGGTGGCTGG CCCGCTGCAC GGCGGTGGCG GTCGTCCCGG TCGCCCTGTG CGTCGTGGCC
TTCCTGCCGC TGGCCGAACT CCTCCAGCCC GACCTGGTGC CCCGCGAGGC CTCCGGCGGA
ACCGAGGTGT TCGCCGCGGC GATGACCGCC ATCGTGCTGC TGGTGCCGTT GCAGTCGGCG
GCCGAGGAGC TGACCCTGCG CGGCATGCTC ATGCAGCTGG TGGGCGCGCT CGGCGCCCGT
CCCGACGAGC GGCGCGGCCG GTCGGCGGTC TCGCGGGTCC TGCGCTCCCC GGGTCCGGCC
ATCCTGGCCA GCGGAACCCT GTTCGCGGCG CTGTACCTGG CCACGCACCC GGGCGACCCC
TGGACGACCG CGGCGCTCGC GGTGATGGGG CTGGGGATGT CCTGGCTGAC CTGGCGCACC
GGCGGTCTGG AGGCGGCGAT CAGCCTGCAC GTGGTCAACA GCCTCGTGCA GTTCACGCTG
TGCGTGTTCG AGGGCCGCAT GGAGCAGATC GGCACGGGGG TCATGGTCGG CTCCGGCCTG
CCCCTGGGCG CGGGTACGCC GCTGGTGCTG GTGCTGACCC TGATCCAGGT GGGGCTGTAC
GTGCTGGCGG TGGTGTGGCT GGCGGGCCGC CGCGGGGTGC GGCGCAGGAG CGCGGCCGCC
GTCCGTTAG
 
Protein sequence
MGNVPPPPGS SWPAAPEGPE GRTVGAYTQE QTWADPPGGG RLTPPGGYAV AATDPDVWAW 
PPPRPVRRNH FARAELPADE SYHRLGRTSR FRWWFTPLTL AVLAVLLFFL WIAVILSVTI
VAIINGSGLA PDSVTLSVIA EMAFGLLSSA LFIPIVLFLV RVVQWRRTGS LFSVEGRLRW
GWLARCTAVA VVPVALCVVA FLPLAELLQP DLVPREASGG TEVFAAAMTA IVLLVPLQSA
AEELTLRGML MQLVGALGAR PDERRGRSAV SRVLRSPGPA ILASGTLFAA LYLATHPGDP
WTTAALAVMG LGMSWLTWRT GGLEAAISLH VVNSLVQFTL CVFEGRMEQI GTGVMVGSGL
PLGAGTPLVL VLTLIQVGLY VLAVVWLAGR RGVRRRSAAA VR