Gene Ndas_2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2271 
Symbol 
ID9246121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2716961 
End bp2718628 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content77% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003680199 
Protein GI297561225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.464934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.787256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACGCG TCCTCACCCG CGTCTTCCGC AACGACTGGA GTGCGCTGAC ACCGCCCGAC 
ATCGGACGCT GGACCCCCGA CCTGCGGGTC AGCGTCGTCA TCCCGGCGCG CGGCGGGCAG
CGCCGCCTCG ACCTCGCCCT GGCCTCCCTG GCCGCGCAGA CCTACCCCGA GGACCTGATG
GAGGTCGTGG TCGTGGACGA CCACTCCTCC CCGGCGCTGC GCCTGCCCGA CCTGCGCCCC
GCGCACTGCC GCGTCCTGAC CGTCCCCGAC GGCGGCTGGG GCGCCGGGTA CGCCCGCGCC
TACGGTGCCC ACTCCAGCAC CGGCGACGTC CTGCTGTGGA TGGACGCCGA CATGGTGGTG
TGCCGCGAGT TCGTCGAGGC CCAGGCCCGC TGGCACCACG TGCACGCCGA GGCCGTCACG
CTCGGCCGGG TCCGCTTCGC CGACACCGGG CCCCAGAGCC CCACCGACGT CCTGGCCCTG
GCCCGCACCG ACGCCCTGCA CGGCGCCCTG GACACCGGCC GCCACCACGC GTGGGTCGAG
CGCGTCCTCA CCGGGAGCGA CGGCCTGCGC GACGCCGACC ACCTGGGCTT CCACGCCTAC
GTCGGCGCGG CGGCGGCCGT GCGCCGCAGC CTGTACGAGG CCGCCGGGGG AGTGGACCCC
GACCTCGACC TGGGCCAGGA CACCGAGTTC GGCTACCGCC TCTGGCAGGC GGGCGCCGTC
CTGCTGCCCG AACCCGCCGC CACCGGCTGG CACGTGGGCC GCGCGGGCAC CGCGCGCACC
CGGCTGCCCT CCGAACGCTT CCGCACCGAG GTCCTCGCCG AGTTCATGCC GCACCCGCAC
GCCTACCGCG AGCGGGTACC GGCCCACCGG CGCCGCATCC CGCTCGTGCA CGCCGTGGTC
GAGGTCTCCG GCGCGCCCTA CGACCTGGTC CGCGGCTGTG TGGACCGGCT CCTGGACAGC
GCCGAGACCG ATCTGGCCCT GACCCTGGTC GCCGACTGGG AGGGCGCCGA GGAGGGAGGG
GGAGCGCGCG GGGCGCGGCG GCTGCGGGCG GTGGACGGCC GGGCCCGGCG CGACGTCGGC
GGACCCCACC TGGACCTGCG GCTGATCCAG GCCAACTACC TGCGCGAACC CCGGATCTCC
TTCGCCACGT CCGCGCCCCG CACCGGTTTC CCCTCACCCT TCCTGCTCCA GGTCCCGGTC
TCCTGGGGCC TGGGCCAGGT GGCCCTGTCG CGCCTGCTGG CCAGCGCCGA GCGCGCCCGG
GCGGGACTCA CCGAACTCTT CCCGGCCGCC TCGCCCACCC GCGACGCCGG GGTGAGGCTG
TGGCGCACCC GGGCGCTGGC CCGGGCCCTG CGGGTGCGCG AGGAGGACGA GGACCTGGGC
GACGTGGTCG CCGCGCTGCA CGGCCGCTAC CGGATCCACG CCGGGGAGGA GACGCTGACC
GACCTGTCGC TGTACCGCTC GGTGCCGCCG CCCCCGCGCA CCGAACCCGA ACCGGCGGAG
TTGGCCGCAC CGTCCCCGCG GGCCGGGACC GAGGAGCGCC CGGCGGGCGA GTGCGGGACG
TGGGAGTGCG GGACGGGGGA GGAGCGCACC GGCGCCGCGC GGTCCGGAGG GTGGCTGCGC
TCGGGATGGG CGCGGGCCCG CCAGCGGCTG CGCCGCGAGC GCGGCTGA
 
Protein sequence
MERVLTRVFR NDWSALTPPD IGRWTPDLRV SVVIPARGGQ RRLDLALASL AAQTYPEDLM 
EVVVVDDHSS PALRLPDLRP AHCRVLTVPD GGWGAGYARA YGAHSSTGDV LLWMDADMVV
CREFVEAQAR WHHVHAEAVT LGRVRFADTG PQSPTDVLAL ARTDALHGAL DTGRHHAWVE
RVLTGSDGLR DADHLGFHAY VGAAAAVRRS LYEAAGGVDP DLDLGQDTEF GYRLWQAGAV
LLPEPAATGW HVGRAGTART RLPSERFRTE VLAEFMPHPH AYRERVPAHR RRIPLVHAVV
EVSGAPYDLV RGCVDRLLDS AETDLALTLV ADWEGAEEGG GARGARRLRA VDGRARRDVG
GPHLDLRLIQ ANYLREPRIS FATSAPRTGF PSPFLLQVPV SWGLGQVALS RLLASAERAR
AGLTELFPAA SPTRDAGVRL WRTRALARAL RVREEDEDLG DVVAALHGRY RIHAGEETLT
DLSLYRSVPP PPRTEPEPAE LAAPSPRAGT EERPAGECGT WECGTGEERT GAARSGGWLR
SGWARARQRL RRERG