Gene Ava_2169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2169 
Symbol 
ID3679882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2683612 
End bp2684748 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content49% 
IMG OID637717512 
Productdihydrouridine synthase, DuS 
Protein accessionYP_322684 
Protein GI75908388 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.320847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATGC CCCAGGTATC GCTGCCCCAA TCGCTTCAGA AAGACCAACC CTTCACCGCC 
CTTGCACCCA TGCAGGATGT GACAAACCTC TGGTTTATGA AGGTCATCGC CCACTACGGT
AGTCCTGACT ACTTTTTTAC TGAGTATTTT CGCGTAAATG ATACTTCGCG GCTCAATCGC
GGTATTCTGG CAGCCATCAC CGAAAATGAT ACAGGTCGCC CTGTTTTTGC CCAAATGATT
GGGGAAAGTA TTCCCGACTT GGTGAGAACC GCCAAGGAAC TCTGCCACTA TGATATCGCT
GGAGTTGATT TGAACATGGG CTGTCCCGCA CCGAGAATCT ATCGTAAAAA TGTCGGGGGT
GGATTGCTAC TCTCACCTGG GAAAGTGGAG CAGATTTTGG CAGAACTACG CCAGGCAGTG
AGCGATCGCC CCTTAACAGT CAAGATGCGG GTAGGCTTTG AAAATACAGA TACCTTTTAC
GAAATTCTCG ATATCGTCAA TCGCCACAAC ATTGATTTGC TGAGTCTGCA TGGTCGCACA
GTCAAAGATA TGTACCACGG GGCAGTGAAA TATGATTTGA TTGCGGAAGC TGTGAAACGA
GTGGATTGTC CAGTACTGGC TAATGGCAAT ATTCACTCCG CCACAACTGC CTTAGAAGTG
CTGGAGCGAA CAGGTGCGGC GGGTGTGATG GTGGGACGCT GGGCTATTGG GAATCCGTGG
CTATTTAATC AGATTCGCCA GGCTTTACGA GGAGAACCAA TCACCCCTGT TCCTTTAGTA
GAAGTACGCA ACTATATTGA TCGTTTATGG CAAACCCCGA CAGCCGCAAC TATGCCAGAG
CGATCGCGCG TAGGCTACTT AAAAATGTTC CTCAATTATA TTGCCCTAAG TGTCGATGCT
GAAGGTCAGT TCCTGCGACT GATGCGACGG ACACAGACCG AAGTGGAAAT GTTTAACCTC
TGCGATCGCT TTTTACTTAG TGATCTGACG CAAACTTTAG CCTTAGCACC TTACTCAGGC
GTAGGGGAGC AGGGGAGCAG GGAGCAGGGA GCAGGGGAGA AAGTATTTTC CCTTCCCTGT
TCTTCTACCT CTTCGGTCAA TAGTCCACAG TCCAAAAACC TTGATTTTGA CCATTGA
 
Protein sequence
MSMPQVSLPQ SLQKDQPFTA LAPMQDVTNL WFMKVIAHYG SPDYFFTEYF RVNDTSRLNR 
GILAAITEND TGRPVFAQMI GESIPDLVRT AKELCHYDIA GVDLNMGCPA PRIYRKNVGG
GLLLSPGKVE QILAELRQAV SDRPLTVKMR VGFENTDTFY EILDIVNRHN IDLLSLHGRT
VKDMYHGAVK YDLIAEAVKR VDCPVLANGN IHSATTALEV LERTGAAGVM VGRWAIGNPW
LFNQIRQALR GEPITPVPLV EVRNYIDRLW QTPTAATMPE RSRVGYLKMF LNYIALSVDA
EGQFLRLMRR TQTEVEMFNL CDRFLLSDLT QTLALAPYSG VGEQGSREQG AGEKVFSLPC
SSTSSVNSPQ SKNLDFDH