Gene Ndas_0796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0796 
Symbol 
ID9244641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp977928 
End bp979178 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID 
ProductUDP-glucuronosyl/UDP-glucosyltransferase 
Protein accessionYP_003678746 
Protein GI297559772 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.702052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGTTA TATTGTTCGC CCCCGAGACA TTCAATCTCG CGGAGACAAC CCGGTCCATC 
GAGGTGGCGA AACACCTCCG TGAAACATAT GAATGCGTCT TTTCCGGGTA TTCCGAAAGA
TACTCCGGCC TCATTGAAGA GGCCGGGTTC ACGTTTCACC GCCTGGCCCC CGCGCTGACC
GACGAGGACG CGGACCAGCT GATCCGCGTC GACCAGGGGA AGGCGGTCCG GCACCCCTTC
ACCGCCGCGA TGCTCCGCAC CCGCGTCGCC AGTGAACTCG CCCTCATCGG CACGCTCCGC
CCGGCGGCGG TCGTGATCGG CACGACGCTC AGCCAGTTCG TCTCCGCCCG TGCCGCCGGC
GTGCCGCTCG TCTACGTCAA GCCCTTCGCC TACAGCTGGC CCCACATCCT CCAGACGCGG
TCGCTGCCGC TCGCCGAAGG GGACGGCCCG CTCCCCCGGG CGGTCAACAC CGGCGCCGCC
GCGCTCCTGC GGGAGGCCGC CCGGGTGACC ACCTACAAGC CCGCGGCCTT CCGGGCAGTC
GCGCGCGAGC ACGGGGTCAG GCTGCCCGGT CGCACCATCC AGGCGCTCGA CGCCGACCTC
AACCTGATCA CCTCCCTCTC CTGCTACCTG CGCCCCTACC GGATGCCCGC GAACTACCGC
CTGGTCGGCC CGGTCTTCGC CCGGATCGAC CGGGAGATCC CTCCGGACGT GGTCCGCGTC
GCCGAGGCCT CGGCGCGGGC GAAGCGCCCG GTGGTCTACT TCGCCATGGG CAGTTCCGGG
AACCGGGAAG TGGTGCTCCG GGTCCTCACC GAGCTCTCCC GGATGCCCGT CACCGTCATC
GCCCCCGTCG CCTCCTACCT GGAGGAGAGC GACCTCCCCC AGGTCGCGGA CAACATCCAC
GTGCGCGACC TCCTCCCCGC ACACCTGCTC GGCGACCTCA TCGACGCCTC CGTGATCCAC
GGGGGCGAGG GGACCGTGCA GACCGCGGTC ACGACCGGAA AACCCTTCGT CGGAATCGGC
CTGCAGATGG AGCAGCGGTG GAACGTGGCG GACTGCGTCC GCTTCGGAAA CGCCGTCGCC
GTCTCCCCCA AGGACGTTTC CGGAGCCTCT TTCCGCAATG CCGTGGAGAA GGTCCTCACG
GACCCCCGCA CCCGTTCCCG CGCACGCACC CTCCGCGAAC TCCTCTCCGG AGTCGACGGG
GCGGCGTCAG CAGCGGAACA CATTCACGAA CACGTGTCTC AGAAGCCGTG A
 
Protein sequence
MRVILFAPET FNLAETTRSI EVAKHLRETY ECVFSGYSER YSGLIEEAGF TFHRLAPALT 
DEDADQLIRV DQGKAVRHPF TAAMLRTRVA SELALIGTLR PAAVVIGTTL SQFVSARAAG
VPLVYVKPFA YSWPHILQTR SLPLAEGDGP LPRAVNTGAA ALLREAARVT TYKPAAFRAV
AREHGVRLPG RTIQALDADL NLITSLSCYL RPYRMPANYR LVGPVFARID REIPPDVVRV
AEASARAKRP VVYFAMGSSG NREVVLRVLT ELSRMPVTVI APVASYLEES DLPQVADNIH
VRDLLPAHLL GDLIDASVIH GGEGTVQTAV TTGKPFVGIG LQMEQRWNVA DCVRFGNAVA
VSPKDVSGAS FRNAVEKVLT DPRTRSRART LRELLSGVDG AASAAEHIHE HVSQKP