Gene Ndas_4854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4854 
Symbol 
ID9248740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5754796 
End bp5756436 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content75% 
IMG OID 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003682743 
Protein GI297563769 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.26673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT TCCTCATCGA ACCCGGTGCG AAGCTGGCCA ACCGCTACCG CCTAGACCAA 
GTGGTGAGCG AGACCGGCGG CGCCACACGC TGGAAGGCCA CCGACGAGAC GCTCGCGCGT
CCCGTGGCGG TGTGGACCTT CGCCGAGGGC TTCCCGCGCA CCTCCGAGGT CGTGCGCGCC
GCCCGGGCCA CGAGCCGCAT TCCCGACGCC CGCGTCACGC AGGTCTTCGA CGCCGACGAC
TCCTCCCCCG TCCCCTACGT GGTCGAGGAG TGGGTCATCG GCTCCTCCCT GGCCGACCTC
CTCGCGCAGG GCCCCATGGA GCCCGAGCGC GCCGCCGGGC TGGTCGCCGA GGCCGCCGAG
GCGATCGCCG CCGCGCACGC CGGAGGCCTT CACCACCTGT GCCTGACCCC GAGCAAGCTC
ATGTGGAGCA GCGGCGGCGC GGTCAAGGTG ACCGGCATCG GTGTGGACGC CGCCCTGCTG
GGCGCCGGCA ACCCCGACCC CGCGGCCACC GACGCCCAGG GGCTGGGAAA CCTGCTCTAC
GCGGCGCTCA CCGGGCACTG GCCCGGCGGC CCGCAGAGCG GCCTGCCCGC CGCACCCGAG
GGTCCCGCGG GCCCCTACCC TCCCCACCAC ATCCGCCAGG GCGTCACCGA ACCCCTGGGC
ACCATCACCA CGCGCGCCGC CCTGCCGCAG CTCGCGGGGC AGCTGGTGCC CGGACCGCCC
ATCGCCTCGC CGGCCGACTT CTGCGCGGCC ATGGCCGAGG TGCCCCGGCT CATCCCCCTG
CCCGTCACCC AGGCCGAGTC GGCCCCGCCC GTGCCCGGGA CCTCCCGGCG CACCGGCGAG
TTCGACAGCA CCGGCCCGCG CCGCGGCCCT CGCGGTACCC GTGGCGGGGC CGCGGGCGCA
CGGGACGACC AGGAGGCGCG CCACGGTTCC GTCGGACGGA CGGGATCCTC CTCGCGGGGC
GGCGGCTCCG TCCGGGGCGG CTCCGAGGTG CGTGAGCGGC AGGGCCCCTC CCGGATGGAG
CAGCGGACCG CGCGCACGCA GCCCTCGCTC CGCAAGATCC TGATCGGCGT GGCGGCCCTG
GTGCTGTTCG CCGGTGTGGT CGTCGGCGCC TGGACGGTCG GGACGATGTT CAGCGCGGGC
GGAGGCGAGG AGGCCCCGCC CGAGGGCGGC GGCGGCCAGG CCGCGGACGG CGGCGGGGAG
GTGGAGCTGA GCCCGCTGGA GATCCAGGGG TCACGCGGCC TCAACCCGCA CGGCAACACC
GACGAGCACT CGGACAAGGC GGGCCGCGCC CACGACGGGG ACACCGCCAC CGAGTGGAAC
ACCCAGGGCT ACAGGGACCC GCTCAGCGAC ATCAAGCCCG GCGTCGGACT CCAGCTCGAC
CTCGGCGCCG TCCACGAGGT CCACGAGGTG GACCTCAACC TCGGCGGCAG CGGCTACGAG
TTCCAGATCC TGGCCGGCGA GAGCGACTCG GACAGCGAGA CCGGTTACGA GGTCGTCGGA
TCGGGCACGG GCGGCTCCCA GGTCGTCACC CTCGACGAAC CGGTCGAGGC CCGCTACGTC
GTGGTCTGGT TCACCGAGCT GGCCGGGTCC GGCGAGTGGA GGGGCACGGT CTACGAGGCC
GAGGTACGAG GGGTCGAGTA G
 
Protein sequence
MSTFLIEPGA KLANRYRLDQ VVSETGGATR WKATDETLAR PVAVWTFAEG FPRTSEVVRA 
ARATSRIPDA RVTQVFDADD SSPVPYVVEE WVIGSSLADL LAQGPMEPER AAGLVAEAAE
AIAAAHAGGL HHLCLTPSKL MWSSGGAVKV TGIGVDAALL GAGNPDPAAT DAQGLGNLLY
AALTGHWPGG PQSGLPAAPE GPAGPYPPHH IRQGVTEPLG TITTRAALPQ LAGQLVPGPP
IASPADFCAA MAEVPRLIPL PVTQAESAPP VPGTSRRTGE FDSTGPRRGP RGTRGGAAGA
RDDQEARHGS VGRTGSSSRG GGSVRGGSEV RERQGPSRME QRTARTQPSL RKILIGVAAL
VLFAGVVVGA WTVGTMFSAG GGEEAPPEGG GGQAADGGGE VELSPLEIQG SRGLNPHGNT
DEHSDKAGRA HDGDTATEWN TQGYRDPLSD IKPGVGLQLD LGAVHEVHEV DLNLGGSGYE
FQILAGESDS DSETGYEVVG SGTGGSQVVT LDEPVEARYV VVWFTELAGS GEWRGTVYEA
EVRGVE