Gene Ndas_3848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3848 
Symbol 
ID9247719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4616624 
End bp4618879 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content73% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003681751 
Protein GI297562777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTA CGGGCAACGA GAAAGCATCC GAAGGGAAGA ACGTGAGCAC ACCGCCTCCG 
CGCGCACTCC TGTACGGGGA CGTGGACCTC AACATCATCG ACGGTTCCGC GATCTGGGCG
CAGTCGATGG CGCAGGCCCT GGCCGCCGCC GGATGCGAGG TGACCCTGCT GCTGAAGGCG
CCCGTGCGCA CCGACCGGCT CACCGAGCCC CTCACCAGGG TGCCCGGGGT CCGGCTGCTG
CGGCCCTACG AGGACAAGGC GCTGCCCGAC CTGGGGCCGA GGGGGCTCAC CCCGGAGCAG
GCCGTCACCC TGATGACCAG GCACCACGAG CGCAAGCCCT TCGACCTCGT CGTCGTGCGC
GGGCGCCGCC TGGCCGGGCT GGCCGCGCAG GAGGAGGCGC TGGCGGGCCG CCTGTGGACC
TACCTGACCG ACTTCCCGCA CAGCGTGGGC GAGCTGTCCG CGACCGCCAC CGCCGAACTC
ACCGAGATCG CGCTGGCCTC GCGGTTCCTG CTCTGCCAGA CCGAGGAGCT GCGCGCCTTC
CTGGAGTCGA CGGTGCCCGC CGCCTGCGGC CGGTCGGTCC TGTTCCCGCC CGTGGTGGTC
GTCCCCGAGG ACGTGCGCGC CGACGGCGGG GCCGGCGGCC GCGCCCGCCT GGCCTACACC
GGCAAGTTCG CGCCCCGCTG GAACACCCTG GAGATGACCG AGCTGCCCGC CGAGCTGGCG
CGGCGCGGGG TGGACGCCGA ACTCGTGATG ATCGGCGACA AGATCCACGC CGAACCCGCC
GACTGGGCCA AGAACATGCG CAGGGCCCTG GAGGGCACCC CGAACGTGGA CTGGCGCGGC
GGGATGTCGC GCGCCGAGGC GCTCCGCCAG GCCGCCGAGT GCGGCTTCGG GCTGTCCTGG
CGCGACCCGT CCATGGACGC CAGCCTGGAG CTGTCCACCA AGGTGCTGGA GCTGGGCGCG
CTCGGACTGC CGGTGGTGCT CAACCGCACG CCCATGCACG AGGCCATGCT GGGCGCGGAC
TACCCGCTCT TCGCCGGGAC CGACGTCGCC TCGGTCGCCG ACGTGGTGGC CCGCGCCCAC
GGCGACCGCG CGGTCTACGC GGACGCCGCC GCGCGCTGCC GCGACGCCGC GGCCGACCAC
ACGCTGGAGC GGGCCGCCGA GCGGCTGCGC GGCTACCTCG CCGACGCCCT GCCGCCCACC
CCCGAGGGCG CCGACCCCGA GCGGCCGCTC AAGGTGGTCA TCGCCGGGCA CGACATGAAG
TTCTTCACCC GCCTGGCCGA GTACCTGGAC TCGCTGCCCG GTCTCGACGT GCGCATGGAC
GAGTGGGAGG GGCTGAGCAC CCACGACCAG TACCGCTCCC GGGAGCTGGC CGCCTGGGCC
GACGTGGTGA TCTGCGAGTG GTGCGGGCCC AACGCGCTGT TCTACTCCAA GTGGAAGCGC
CCCGACCAGC GGCTCATCGT GCGGCTGCAC CGCTTCGAGC TCTACGCGGA GTGGCCCCGC
AAGCTCGACA TCGACAAGGT CGACGCGGTG GTGTGCGTGA GCCCCCACTA CGCCGACCTG
ACCCGCGAGA TCACCGGGTG GCCCGCCGGG AAGGTGGTCG TGGTCCCCAA CTGGGTGGAC
GACGAGCAGC TCGGCCGCCC CAAGCTCCCC GGGGCCGAGT ACTCCCTGGG CATGGTCGGC
ATCGCGCCCT CGCGCAAGCG GCTGGACCGG GGCCTGGACG TCATCGCCGA GCTGCGGCGC
ATGGACCCGC GGTACACGCT GTCGGTCAAG ACCAAGCAGC CGTGGGAGTA CTGGTGGATC
TGGAACCGGC CGGAGGAGCG CGCCTACTTC GAGCGCGTCT ACCGGCGGAT CCAGCGCGAC
GAGCGCCTCG CCTCCGGGGT GGTGTTCGAC CCCTTCGGGC CGGACGTGGC CACCTGGCTG
CGGCGGGTGG GGTTCATGCT CTCCACCAGC GACGACGAGT CCTTCCACCT GGCGCCCGCC
GAGTGCGCCG CCTCGGGCGG CGTGCCCGCC CTGCTGCCGT GGCCGGGCGC GGACACCATC
TACGACCCGC ACTGGATCCA CGACGACGCC GTGGCGATGG CCGAGGCCAT CCACGCGACC
GTCAGCGAGG GGCGGTTCTC CTCGGAGGCC GCGCGCGCAC GCGAGGAGGT CACCACCGCC
TACGGCCTGT CCCGGGTGCG GTCGCTGTGG AGCGACCTGG TGGTCCGCGG CAGGGCGCCC
CAGGCGGAGC ACTCCGCCGC CACAGCAGGC GCCTGA
 
Protein sequence
MSRTGNEKAS EGKNVSTPPP RALLYGDVDL NIIDGSAIWA QSMAQALAAA GCEVTLLLKA 
PVRTDRLTEP LTRVPGVRLL RPYEDKALPD LGPRGLTPEQ AVTLMTRHHE RKPFDLVVVR
GRRLAGLAAQ EEALAGRLWT YLTDFPHSVG ELSATATAEL TEIALASRFL LCQTEELRAF
LESTVPAACG RSVLFPPVVV VPEDVRADGG AGGRARLAYT GKFAPRWNTL EMTELPAELA
RRGVDAELVM IGDKIHAEPA DWAKNMRRAL EGTPNVDWRG GMSRAEALRQ AAECGFGLSW
RDPSMDASLE LSTKVLELGA LGLPVVLNRT PMHEAMLGAD YPLFAGTDVA SVADVVARAH
GDRAVYADAA ARCRDAAADH TLERAAERLR GYLADALPPT PEGADPERPL KVVIAGHDMK
FFTRLAEYLD SLPGLDVRMD EWEGLSTHDQ YRSRELAAWA DVVICEWCGP NALFYSKWKR
PDQRLIVRLH RFELYAEWPR KLDIDKVDAV VCVSPHYADL TREITGWPAG KVVVVPNWVD
DEQLGRPKLP GAEYSLGMVG IAPSRKRLDR GLDVIAELRR MDPRYTLSVK TKQPWEYWWI
WNRPEERAYF ERVYRRIQRD ERLASGVVFD PFGPDVATWL RRVGFMLSTS DDESFHLAPA
ECAASGGVPA LLPWPGADTI YDPHWIHDDA VAMAEAIHAT VSEGRFSSEA ARAREEVTTA
YGLSRVRSLW SDLVVRGRAP QAEHSAATAG A