Gene Avin_11940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11940 
SymbolrluD 
ID7760136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1145900 
End bp1146910 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content67% 
IMG OID643804095 
ProductPseudouridine synthase 
Protein accessionYP_002798397 
Protein GI226943324 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGAAA CCAAAGGCCA TGCCCCGCCG TTGCCGAGCA TGTCAATACA CAACGATCAG 
GCTATCAGAC TTCGCGCCGA AGTCCCCGTC GAGCTGGGCG GCCAACGCCT GGACCAGGTC
GCCGCACAAC TCTTCGCCGA GCATTCGCGC TCGCGCCTGG CCGCCTGGAT CAAGGACGGG
CGACTGACCG TCGATGGCGC CGCGCTGCGC CCCCGCGACA TCGTGCATGG CGGCGCGCTG
CTCGAACTGG ATGCCGAGCA GCAGGCGCAG GGCGAGTGGA TCGCCCAGGC CATCGAACTG
GACATCGTCC ACGAGGACGA GCAGATCCTT GTGCTGAACA AGCCGGCCGG GCTGGTCGTG
CATCCGGCTG CCGGGCATGC CGACGGCACC CTGCTCAATG CGTTGCTGCA CCATGTGCCG
GGGCTGGTCA ATGTGCCGCG CGCGGGCATC GTCCACCGTC TGGACAAGGA CACGACCGGC
CTGATGGTGG TGGCCAAGAC CCTGCAGGCG CAGACCCGGC TGGTCGAGCA ACTGCAAAAG
CGCAGTGTCA GCCGCATCTA CGAGGCCATA GTGGTCGGTG TGATCACCGC CGGCGCCACC
ATCGATGCGC CCATCGGCCG GCATGGCCAG CAGCGCCAGC GCATGGCGGT GGTCGAGGGC
GGCAAGCCCG CGGTCAGCCA CTACCGTGTG CTCGAACGCT TCCGTGCGCA TACCCATGCC
CGTATCAAGC TGGAAACCGG GCGAACCCAC CAGATCCGCG TGCACATGGC GCACGTCGGC
TATTCGCTGG TGGGCGATCC GGTCTATGGC GGGCGTTTCC GCATTCCGCC GGCGGCCAGT
CCGACCCTGG TGCAGGCGCT TCGGGAGTTT CCGCGCCAGG CCCTGCACGC CCGCTTCCTC
GAACTGGATC ACCCGGCCAG CGGCGAGCGC CTGAAATGGG AGGCGCCGCT GCCGGACGAT
TTCGTCTGGC TGTTGACACT GCTGCGCCAG GACAACGAGG CGTTCGTCTG A
 
Protein sequence
MVETKGHAPP LPSMSIHNDQ AIRLRAEVPV ELGGQRLDQV AAQLFAEHSR SRLAAWIKDG 
RLTVDGAALR PRDIVHGGAL LELDAEQQAQ GEWIAQAIEL DIVHEDEQIL VLNKPAGLVV
HPAAGHADGT LLNALLHHVP GLVNVPRAGI VHRLDKDTTG LMVVAKTLQA QTRLVEQLQK
RSVSRIYEAI VVGVITAGAT IDAPIGRHGQ QRQRMAVVEG GKPAVSHYRV LERFRAHTHA
RIKLETGRTH QIRVHMAHVG YSLVGDPVYG GRFRIPPAAS PTLVQALREF PRQALHARFL
ELDHPASGER LKWEAPLPDD FVWLLTLLRQ DNEAFV