Gene Ndas_4601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4601 
Symbol 
ID9248482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5456573 
End bp5457901 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682493 
Protein GI297563519 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGA CCGTCGAGAA CGAAGGCGGA GGCACGTCGT CGGCGTACGC CTCCAGTACC 
GTCCTCCTGC CCTCTCCGCG GACACCGCGG CCCGGGGCGG GCAGGGACAG GTTCCTGGAC
GTCATCCGGC TGTTCGTGAT GGCCCTGGTG GTCATGCAGC ACTGGTGGCT GCCGGTCCTC
GTCCACGAGC CCGGGTCCCT CGAGGCGGGC AGCGTCCTGT CCACCGAGGG CGGCTTCGTG
CTGACGTGGG TCGTCCAGGT CATGCCGCTG ATCTTCTTCG TCGGGGGCGC GGCCAACCTG
ATCAGCTGGC GCTCGGCCTC GGGCCGGGGC ATGTCCGCCT CGGACTGGTT CGCGCGCAGA
CTGCGCAGAC TCGCCTGGCC GGTGGTGCCG CTGGCGGCCC TGTGGATCGT CGCCTCCCAC
CTGCTGGTCC TGGGCGGCGC ACCGGCCCAG GCCGTCCTGG TGGGCGCCGA GGCCGCGGGC
ATGGTGCTGT GGTTCCTCGC CGTGTACGTG CTGGTGGTCG TGTCCACCCC GCTGCTGTTC
CGGGCCCAGG AGCTCTTCGG CTGGTGGGTC CCGATCGCCC TGCTGGCCGC CGCCGCGGCC
GTGGACCTGA CCCGCTTCTC CACCGGCGCG GACTGGGTCG GCTACCTCAA CGTGGCCTTC
GTGTGGCTGG GCGTGCACCA GCTGGGGTTC CGCTACGCGA CGGGGACGAT CCGCCTGCGC
TACGCCGCGG GGATGGTGGC GGGCGGCGCG GCCGCCGCGC TGGCCCTGAC GACGTTCGGC
CCCTACTCGC TCAACATGAC GGGCGTGTTC GCCACGGAGT CCTCCAACGT GTCCCCGCCC
ACCCTGGTGC TCGCCGCCAT GGGCGCGCTG CAGATCGGCG TCGCGGTGCT GCTGCGCGAG
CGGATCAGCG CCTGGTCCGA GCGCCCGGGC CCGGCGCGTC TGCTGGACCG GATCTGCCCG
CAGCTGATGA CGGTCTACCT GTGGCACATG CTGCCGCTCA GCGTGGTGGC GGGCGTGCTG
GTGTTCGGCC TGGGGATCGA CACCCCCGAG CCGCTGACGG GCCTGTGGGT GTTGTGGGGC
GTGCTGGGGC TGGTGGTCCT GGTGCCCCTG ATCGTGCCGC TGGCGCACTG GGCGGTGCGG
TTCGAGAACC CGCCGAAGGT GCTGAGCGGT TCCCCGGGCA TGGTCCGCGT CCTGGCCGCC
GCGGCGCTGG TCGGCGGCGG GATGCTGCTG CTGACGGTGT CCGGCCTGGG GCTCGGCATG
GGACCGGTGC TCGGACTGCT CGCGGTGCTG TCGGGCGTGG TGCTGACTCG GGCTCCCCGG
AGGAGCTGA
 
Protein sequence
MTVTVENEGG GTSSAYASST VLLPSPRTPR PGAGRDRFLD VIRLFVMALV VMQHWWLPVL 
VHEPGSLEAG SVLSTEGGFV LTWVVQVMPL IFFVGGAANL ISWRSASGRG MSASDWFARR
LRRLAWPVVP LAALWIVASH LLVLGGAPAQ AVLVGAEAAG MVLWFLAVYV LVVVSTPLLF
RAQELFGWWV PIALLAAAAA VDLTRFSTGA DWVGYLNVAF VWLGVHQLGF RYATGTIRLR
YAAGMVAGGA AAALALTTFG PYSLNMTGVF ATESSNVSPP TLVLAAMGAL QIGVAVLLRE
RISAWSERPG PARLLDRICP QLMTVYLWHM LPLSVVAGVL VFGLGIDTPE PLTGLWVLWG
VLGLVVLVPL IVPLAHWAVR FENPPKVLSG SPGMVRVLAA AALVGGGMLL LTVSGLGLGM
GPVLGLLAVL SGVVLTRAPR RS