Gene Avin_38720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_38720 
SymboltruD 
ID7762761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3916494 
End bp3917552 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content69% 
IMG OID643806735 
ProducttRNA pseudouridine synthase D 
Protein accessionYP_002800987 
Protein GI226945914 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.711816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGT CCGAACTGCT CGGTCCCAGG GCATATGGCG AAGCCTGCGG CCAGGCACTG 
CTCAAGGCCT GCGCCGAGGA CTTCCAGGTC GACGAGGTGC TGGACATTCC CCTGAGTGGG
CAGGGCGAGC ATCTCTGGTT GTGGGTGGAA AAGCGCGGGC TGAATACCGA GGAGGCGGCT
CGGCGCCTGG CCCGTGCCGC CGGGATGCCG CTGAAGGCGA TCAGCTATGC CGGTCTGAAG
GATCGCCAGG CGCTGACCCG CCAGTGGTTC AGCCTGCACC TGCCGGGCAA GGCCGATCCC
GACCTCGTGG CCGCGGAGGA CGACAGCCTG CGTATCCTCG AGCGTGTCCG CCACTCGCGG
AAACTGCAAC GTGGTGCCCA CGCGGCCAAC GGTTTCAAGC TGCGGCTGAC TCGCCTCGCC
GCCGATCGTC CGGCGCTGGA TGCGCGTCTG GAGCAGCTCC GCCGCCAAGG AGTGCCCAAC
TACTTCGGCC TGCAGCGTTT CGGCCACGAC GGCGGCAATC TCGCCGAGGC CAGGGCCTTC
GCCGTGCGGC GGGAACTGCC TGCGCAACGC AACCTGCGCT CGCGTCTGCT CTCGGCAGCG
CGCAGCTATC TGTTCAACCG GGTGCTGGCC GAGCGGGTCG CGGTAGGCGA CTGGAATCGG
GCGCAGCCGG GAGACCTGCT GGCTTTCACC GACAGCCGCA GTTTCTTCCC GGCGGGCGTG
GAGGAGTGCG CCGACCCGCG CCTGGCACTG CTCGACCTGC ATCCCACCGG CCCGCTCTGG
GGCGCGGGCG GCTCCCCGGC CGGCGCGGCG ACCAAGGTGC TGGAGGATGC CGTCGGCCGG
TGCGAGGCGC CACTCGGCGA CTGGCTGGGG GAAGCGGGCA TGCTGCACGA ACGGCGCATC
CTGCGCCTCC CCATCGACCG GCTGGCGTGG CATTATCCCG CCATCGACAT CTTGCAACTG
GAATTCGTCC TGCCGGCCGG CTGCTTCGCC ACTGTCGTGG TCCGCGAGCT CGTCGATCTG
TGGCCGGCAG GCTTAATGGA CACTTCATGC GTATTCTGA
 
Protein sequence
MSESELLGPR AYGEACGQAL LKACAEDFQV DEVLDIPLSG QGEHLWLWVE KRGLNTEEAA 
RRLARAAGMP LKAISYAGLK DRQALTRQWF SLHLPGKADP DLVAAEDDSL RILERVRHSR
KLQRGAHAAN GFKLRLTRLA ADRPALDARL EQLRRQGVPN YFGLQRFGHD GGNLAEARAF
AVRRELPAQR NLRSRLLSAA RSYLFNRVLA ERVAVGDWNR AQPGDLLAFT DSRSFFPAGV
EECADPRLAL LDLHPTGPLW GAGGSPAGAA TKVLEDAVGR CEAPLGDWLG EAGMLHERRI
LRLPIDRLAW HYPAIDILQL EFVLPAGCFA TVVVRELVDL WPAGLMDTSC VF