Gene Ndas_4743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4743 
Symbol 
ID9248625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5626500 
End bp5627687 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content75% 
IMG OID 
ProductROK family protein 
Protein accessionYP_003682635 
Protein GI297563661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGAA CCCCCGGACG CCTGTCCTCT GCTGCCACCA GCGGCCACAT CCTGGAGCTC 
ATCCGCTCCG GAACCGCGAC CAGCCGCTCC GAGATCGGCC GGGTCACGGG CCTGTCCCGT
CCCTCCGTCG CCCTGCGGGT CACCGAGCTG ATCGGCGGCG GGCTCGTCAC CGAGGGCACC
GGCGCGGTCT CCAGCGGCGG ACGGCCGCCC ACCCTGCTGG AGTTCAACGC CGCCAGCGGC
CTCATCCTCA CCAGCGCCCT GGGCATGGTC CGCAGCCAGG CCGCCGTGTG CGACCTCGAC
GGCGAGGTCC TCGTGCGCAC GCCGGGCTCG CCCGCGATGG AACAGGGCCC CGACGCCACC
CTCCCCTGGC TGCTGGACAC CTGGTCGGAG CAGATCGCCT CGCTCGGCCG CGACCCCGGT
GACGTGCGCG GCGTGGGCAT CGGCCTGCCC GGCACCGTCG AGTTCCACGC GGGCCGCGCC
GACGACCGCC CCTTCCTCGG CAAGTGGGCG GGCGTGGCGC TGGCCCCGCT GGTCGCCGAG
CGCTTCCCGG TGCCGGTGAT GGTGGACAAC GACGTGAACG TGATGGCGCT CGGCGAGCAC
ATCGCGGGCG GGCACGGGCA CCCCGACGAC ATGGTGTTCG TGAAGGCCTC CACGGGGATC
GGCGCGGGTC TGCTGTCCGG CGGGCGGCTG CTGCGCGGTT CGCTGGGCGC CGCCGGGGAG
ATCGGGCACA TCCCGGTGCG CGGCGCGGGC GGGCTCCCGT GCCGCTGCGG CAACACCGAC
TGCCTGGAGG CGGTCGCGGG CGGGCGGCGG CTGCTGGAGA GCGCCGCCGA ACAGGGGTGC
CGGGCACGGA CGCTCAAGGA CCTGGTGGCG CTGGCCTCGG GGGGCGACCC GGTGGCCGTC
ACCCTGGTCC GGGAGGCGGG GCGCAGACTG GGTGAAGCGC TCGCGGGGGC GGTGAACCTG
CTCAACCCCG AGGTGATCGT GCTGGGCGGC GACCTGGCCG AGGCCTACGA CCACCTGGTG
GCGGGCGTGC GCGAGGTGGT GTTCCAGCAG TGCACCGCCC TGGCCACCCG TCAGCTGCGC
GTCGTGGCCA GCTCCCTGTG GGACGAGGCG GGCGTGCGCG GCTGCGCGGC CATGGTGACC
GAGGAGATCC TGTCGCCGGA GGCGGTGAAC AAGCTCCTGG CGGGTTGA
 
Protein sequence
MARTPGRLSS AATSGHILEL IRSGTATSRS EIGRVTGLSR PSVALRVTEL IGGGLVTEGT 
GAVSSGGRPP TLLEFNAASG LILTSALGMV RSQAAVCDLD GEVLVRTPGS PAMEQGPDAT
LPWLLDTWSE QIASLGRDPG DVRGVGIGLP GTVEFHAGRA DDRPFLGKWA GVALAPLVAE
RFPVPVMVDN DVNVMALGEH IAGGHGHPDD MVFVKASTGI GAGLLSGGRL LRGSLGAAGE
IGHIPVRGAG GLPCRCGNTD CLEAVAGGRR LLESAAEQGC RARTLKDLVA LASGGDPVAV
TLVREAGRRL GEALAGAVNL LNPEVIVLGG DLAEAYDHLV AGVREVVFQQ CTALATRQLR
VVASSLWDEA GVRGCAAMVT EEILSPEAVN KLLAG