Gene Ndas_4956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4956 
Symbol 
ID9248844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp99633 
End bp101156 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content73% 
IMG OID 
ProductTAP domain protein 
Protein accessionYP_003682844 
Protein GI297563871 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.562249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.30314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGCCG GGTCCGTGCT GCTGGCGACG GCGTGCACGG CGCAGGGCGA CCAGCCGAGG 
GAGGGAGGCG TCGCGCCCGG GCTCTCCGGG GACCTGGCGG CCTTCGCCGA CCAGGAACTG
GCCTGGGGCG AGTGCGAGAG CGGAGCGCCC GGCACCGAGT GCGCCACCTA CGAGGTGCCC
CTCGACTACG GGGACCCGGA CGGCGAGCGC ATCGAGATCG CGGTCAAGCG CTTCCCCTCT
GAGGGAGGCG ACGTCCTGGG CTCCCTGCTC GTCAACCCCG GCGGGCCCGG CGGCTCCGGG
TACGACTTCG TGGACCACGC CCCCTACACG GTCAGCGACG CGGTGCGCGA GAGGTTCGAC
GTGGTCGGGT TCGACCCGCG CGGTGTCGGT CGCAGCTCAC CGCTGACCTG CCTGGACGCC
GAGGGCATCG ACGAGTTCCT CGGCGGGGTG GACAGCGTCG AGGGCGACGG CGACATGTCC
GAGGTCTCCG CGGCCGAGCT GGCCGAGCTG GAGGAGGACA GCCGCGGCTT CGTCGAGGCC
TGCCAGGCCA ACCACCCCGA GCTGATGCGG CACGTGGGCA CCGCGGACGT GGCCCGCGAC
ATGGATCTGC TGCGCGCCCT GCTCGGCGAC GAGAAGCTCA CCTACCTGGG CGCCTCCTAC
GGCACCAGCA TCGGCGCCCA CTACGCCGAG CAGTTCCCCG ACCGCGTCCG CGCGCTGGTG
CTCGACGGCG CAGTGGACCC CAGCCAGGGG CAGCTCGACC TCAGCGTGCA GCAGGCGACC
GGGTTCGAGA CCGCCCTGCG GGCCTTCGTG GAGGACTGCC TGAGCCGGTC GGACTGCCCG
CTCGGCGCCC CCGGCGACAG CGTGGACGAC GGCATCGGGG CGCTGACCGC CTTCCTAGCC
GACACCGCCG AGAACCCCCT GTCCAACAGC ATGGACGACC GCGAGGTCAA CCGCGCCCGC
GCCGAACTGG GCGTGCTCGC CGCGCTCTAC ACCGAGGACT GGTGGCCGCG CGTGCGCGAG
GCCCTCACCG CCGGTACGGA GGGCGACGGC ACCCTTCTGC TCCAGCTCGC CGACGACCTC
TACAGCCGGA GCGACACGGA CGCCTACGTC AACTCCACGG CCGCGCTCAT CGCGGTGAAC
TGCTCCGACT CGCCCAGCCC GCGCGACGTG GAGGCCTACA CCGAGGCCGC GGCCCGGGCC
GGTGAGGAGT CACCGATCTT CGGCCCCAGC CTGGCGTGGG GCGCCCTGCC CTGCGCGTAC
TGGCCGGAGG AGGCGGTCGA CCCGCCCGTG GAGCTGGACG GGGACGGAGC CGCGCCCGTC
ATGGTGCTGG GCACCACCCG GGACTCGGCC ACCCCGTACG CGTGGTCCGA GGCGCTCGCG
GAGCAGCTCG ACTCGGGTTT CCTGGTGACC CGCGACGGCG ACGGACACAC CGGTTACCGG
ATGGGCGACC AGTGCGTCGA CGCGATGGTG GACGCCTACC TGGTCGACCT CACCGTGCCC
GAGGACGGCA TGGCCTGCGC CTGA
 
Protein sequence
MLAGSVLLAT ACTAQGDQPR EGGVAPGLSG DLAAFADQEL AWGECESGAP GTECATYEVP 
LDYGDPDGER IEIAVKRFPS EGGDVLGSLL VNPGGPGGSG YDFVDHAPYT VSDAVRERFD
VVGFDPRGVG RSSPLTCLDA EGIDEFLGGV DSVEGDGDMS EVSAAELAEL EEDSRGFVEA
CQANHPELMR HVGTADVARD MDLLRALLGD EKLTYLGASY GTSIGAHYAE QFPDRVRALV
LDGAVDPSQG QLDLSVQQAT GFETALRAFV EDCLSRSDCP LGAPGDSVDD GIGALTAFLA
DTAENPLSNS MDDREVNRAR AELGVLAALY TEDWWPRVRE ALTAGTEGDG TLLLQLADDL
YSRSDTDAYV NSTAALIAVN CSDSPSPRDV EAYTEAAARA GEESPIFGPS LAWGALPCAY
WPEEAVDPPV ELDGDGAAPV MVLGTTRDSA TPYAWSEALA EQLDSGFLVT RDGDGHTGYR
MGDQCVDAMV DAYLVDLTVP EDGMACA