Gene Ndas_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0391 
Symbol 
ID9244229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp481257 
End bp482630 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content74% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003678345 
Protein GI297559371 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.380998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCAGT CGGGCACCAC ACCGAGCCCA CGGCCCCGTG CCGAGCGCCC CCTGCGCGTC 
GCCCTGCTGT CCTACCGCAG CAAGCAGCAC GTCGGCGGAC AGGGCGTGTA CGTCCGCCAC
CTCTCGCGCG AACTGGCCGC CCTGGGCCAC GAGGTCACCG TCCTCTCCGG CCAGCCCTAC
CCCGTCCTGG ACGAGGGCGT GACCCTGGAG AAGGTCCCCT CCCTGGACCT GTACAACGAC
GCCCACCCCT TCAAGGCCCC GCCCGTACGC GAGTGGCGCG ACTGGATCGA CGCCCTGGAG
GTCGCCACCA TGTGGACCGC CGGGTTCCCC GAGCCCCTCA CCTTCTCCCT GCGCGCCAAC
CGCGAACTGC GCCGCCGCCT GGACGACTTC GACGTCGTCC ACGACAACCA GACACTGGGC
TGGGGACTGC TCGGCATCAG GTCCGCCGGG CTGCCGCTGG TCACCACCAT CCACCACCCC
ATCAGCGTGG ACCGCAGGAT CGAACTGGCC GAGGCCCGGG GCCTGCACAG GCTCACCAAG
CGCCGCTGGT ACGGGTTCGT GGGCATGCAG GCCAGGGTCG CCCGCCGGCT CGACCCGATC
CTGGTGCCCT CCCAGTCCTC CGCCGACGAC ATCGCCCGCG AGTTCGGCGT CGCCCCCTCC
GCCATGGAGG TCACCCCGCT GGGCGTGGAC ACCCGCCACT TCCACCCGCG CCCCGCACTG
GAGCGCGTAC CGGGACGCAT CGTGTGCACC GCCAGCGCCG ACAGCCCCCT CAAGGGTGTG
GCGGTCCTGC TGCGCGCCGC GGCCAAACTC GCCACCGAAC GCGACATCAC CCTGACCGTC
GTCAGCAGAC CCAAGCCCGG CGGCCCCACC GACCAGCTGG TGGACGAGCT GAGCCTGCGC
GACCGCGTCG AGTTCGTCAG CGGCATCGAC GACACCGCCC TGGCCGAACT CATCGCCAGC
GCGCAGGTGG CGGTCGTCCC GTCCTTCTAC GAGGGGTTCT CCCTGCCCGC GGTGGAGGCC
ATGGCCTGCG CCACGCCGCT GGTGGCCAGC CACGCCGGAG CGCTGCCCGA GGTCGTGGGC
ACCGACGGGG ACGCCGGGCG CCTGGTGCCC CCGGGCGACC CCGAGGTGCT CGCCGAGGCG
CTCGCCGCCC TGCTCGACGA CGACGCCGAA CGCGAGCGCA TGGGCGCCGC CGCCTGGCGC
CGGGTCCAGG AACGCTTCAC GTGGAGAGCC GTCGCCGAGC TGACCGCGCG CCGCTACGCC
TCCACCATCG ACGCCGTGAG GGGCAACCGC CCCGTCGACG GCGACACGCG CCCCGGGCCC
GCCGAGCCCG CGTCCCCGCC AGCACCCGAG CGGGCCCGCG CGAACGGCGC CTGA
 
Protein sequence
MTQSGTTPSP RPRAERPLRV ALLSYRSKQH VGGQGVYVRH LSRELAALGH EVTVLSGQPY 
PVLDEGVTLE KVPSLDLYND AHPFKAPPVR EWRDWIDALE VATMWTAGFP EPLTFSLRAN
RELRRRLDDF DVVHDNQTLG WGLLGIRSAG LPLVTTIHHP ISVDRRIELA EARGLHRLTK
RRWYGFVGMQ ARVARRLDPI LVPSQSSADD IAREFGVAPS AMEVTPLGVD TRHFHPRPAL
ERVPGRIVCT ASADSPLKGV AVLLRAAAKL ATERDITLTV VSRPKPGGPT DQLVDELSLR
DRVEFVSGID DTALAELIAS AQVAVVPSFY EGFSLPAVEA MACATPLVAS HAGALPEVVG
TDGDAGRLVP PGDPEVLAEA LAALLDDDAE RERMGAAAWR RVQERFTWRA VAELTARRYA
STIDAVRGNR PVDGDTRPGP AEPASPPAPE RARANGA