Gene Ndas_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3851 
Symbol 
ID9247722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4622261 
End bp4623988 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content75% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003681754 
Protein GI297562780 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0448869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.314973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCTGGC GACACCTGCG GGCCGAACCG GCCCGGCTGC CGCTGCTGGC GCTGCGGCTG 
ACGCCCGGTC CGCCCCGGCG CGCGGTGCGC GCCCTGGCCT CCCGGGCGGG CGGCCGGGCC
CGCGCCTACG CCCTGTGGGA CCAGGGCCGG CGCGCGGACG CCCGCGAGGC GGTGCTCGCC
GCGGCGAGCG GTGCCTCCCC CCGCCGGGTG GGCCGCCTGG TGGCGTCCTG CCTGGCGGCG
GGAGACGCGG CCACCGCCCG GGAGCTGGTC GAACAGCTTC CGGAGGGCGC TCTTCGGGAG
GCCGCGGAGC ATCGGACGGC CCTGGCCACG GGACGCGCTG TCGCCGCGCC CCCGTCCACG
TCGCCTGACC GTCACGCCAT CACGTTAACC CGCGATCCAC GAACGCCGCC CGACGCCACG
GTGAACGCTC GGCGACCGGA GGCGTCGGAG GGTGAACGGT GGCGTGTGGC CGGGGACCCG
GCGTCACCCG GAGCGGGCCT GCGGGTGCTG CACCTGGTGA CCAACGCGCT GCCGCACACC
AACGCGGGCT ACACCCAGCG CACCCACAAG ATCGCGGTCG CCCAGCGCGA GGCCGGAATG
GACGTGCACG TGGTCACCCG GGCGGGGTAC CCCCTGGTCA AGGGGGTTCC CGACCCGCGC
ACGCTGGTGC GGGTGGACGG CATCCCCTAC CACCGCCTCC TCCCCTGGAC GGCGCCCGCC
GACGCCGCCC AGGAGCTGGC CGCGGGGGTG CGGCTGGGGT CGGAGCTGGT GGAGGCGCTG
CGGCCCGACG TGCTGCACGC CGCGAGCAAC CACCACAACG CCCGCCTGGC CCTGGAGCTG
GGCCGCAGGT TCGGCCTGCC GGTGGTGTAC GAGGTGCGGG GCTTCCTGGA GGAGTCGTGG
CTCTCGCGCG ACCCCTCGCG CAGCGTGGAC GACGCCTTCT ACCGGGCCGA GCGCGCGAGC
GAGACCGAGT GCATGCTGGC CGCCGACCTT GTGGTGACCC TGGGCGAGGC GATGCGCGCC
GACATCGAGG CGCGCGGCGT GCCGCGCGAG CGCCTGCTGG TGGTGCCCAA CGCGGTGGAC
GCCTCCTTCC TGGCCCCGCT GCCGCCGGGC GCGGGGGTGC GCGCCGAGCT GGGGATCGGC
GGCGAGGACT TCGTGGTGGG CACCACCACC AGCTGCTTCG GCTACGAGGG CCTGGACACG
CTGCTGGAGG CGGTGGCCCT GATGCGCGAA CGCGGCGAGG CGGCGCACGC CCTGGTGGTG
GGGGACGGCC CCGAGCTGCC CGCGCTGCGC TCCCTGGCCG ACTCGCTGGG TCTGGAGGGG
GCCGCGCACT TCACCGGCCG CGTCCCGGCC GCGCGGGTGC GCGACCACCA CGCCGCGCTG
GACGTGTTCG CGGTGCCCAG GCGCGACGAG CGGGTGTGCC GTCTGGTCAC CCCGCTCAAA
CCCGTGGAGG CCATGGCGGG CGGGCTTCCG GTGGTGGCCA GTGATCTCCC CGCGTTGCGA
GAGATCGTGG AACCGGGAGT GACAGGAGAG TTAATTCCGG CAGGCGAATC GGCGACCCTA
GCCGATGTGC TGACAAAACT CGCTTACAGT CGTGAAAAGC GGATCTCCTA CGGCAGTGCG
GGTCGCGATC TCGTCGGCGA CCGCACCTGG GCCGAGGCCG CATACCGCTA CAACCAGGCG
TATCGGGTTC AGATTCGCGA AATGACCGAA CCAGGCCAGA ACCCGTAA
 
Protein sequence
MAWRHLRAEP ARLPLLALRL TPGPPRRAVR ALASRAGGRA RAYALWDQGR RADAREAVLA 
AASGASPRRV GRLVASCLAA GDAATARELV EQLPEGALRE AAEHRTALAT GRAVAAPPST
SPDRHAITLT RDPRTPPDAT VNARRPEASE GERWRVAGDP ASPGAGLRVL HLVTNALPHT
NAGYTQRTHK IAVAQREAGM DVHVVTRAGY PLVKGVPDPR TLVRVDGIPY HRLLPWTAPA
DAAQELAAGV RLGSELVEAL RPDVLHAASN HHNARLALEL GRRFGLPVVY EVRGFLEESW
LSRDPSRSVD DAFYRAERAS ETECMLAADL VVTLGEAMRA DIEARGVPRE RLLVVPNAVD
ASFLAPLPPG AGVRAELGIG GEDFVVGTTT SCFGYEGLDT LLEAVALMRE RGEAAHALVV
GDGPELPALR SLADSLGLEG AAHFTGRVPA ARVRDHHAAL DVFAVPRRDE RVCRLVTPLK
PVEAMAGGLP VVASDLPALR EIVEPGVTGE LIPAGESATL ADVLTKLAYS REKRISYGSA
GRDLVGDRTW AEAAYRYNQA YRVQIREMTE PGQNP