Gene Ava_0645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0645 
Symbol 
ID3678674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp813122 
End bp814174 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content46% 
IMG OID637715973 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_321164 
Protein GI75906868 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.446085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.634764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTACTC TGTCCCCCAA CCTCAAAGCT AGACTTTCCC AACCCCTAAA CATCGGCTCG 
TTTGTAGTTA AAAGCCGTGT TCTTCAGTCG CCTTTGTCGG GGGTGACAGA TATGGTGTTT
CGCCGTCTAG TACGTCGCTA TGCACCCGAT TCGATGATGT ATACAGAAAT GGTGAATGCT
ACGGGTTTAC ACTACGTCCA GCAGTTACCA AAAATTATGG AAGTAGACCC CAACGAGCGA
CCAATCAGTA TTCAGTTGTT TGACTGTCGT CCCGATTTTT TGGCAGAAGC AGCAATCAAA
GCCGTTGCGG AAGGCGCTGA TACGATTGAT ATCAATATGG GGTGTCCGGT AAATAAAATT
ACCAAAAACG GCGGAGGTTC TTCTTTACTA CGACAGCCGG AAGTTGCAGA AGCCATTGTA
CGGGAAGTAG TAAAAGCTGT TAATGTGCCG GTCACTGTCA AAACCCGGAT TGGCTGGAAT
GACAGAGAAA TTACCATTCT CGATTTTGCC AAGCGCATGG AAGACGCTGG AGCGCAAATG
ATTACGGTGC ATGGACGTAC CCGCGCTCAA GGTTACAATG GCAATGCCCG TTGGGAATGG
ATAGCCCGTG TCAAAGAAAT ACTTTCCATC CCCGTGATTG GTAATGGCGA TATATTTTCC
GTAGAATCGG CGGTGAAATG TTTAGAAGAA ACGGGTGCTG ATGGTGTGAT GTGTTCCCGT
GGGACTTTAG GTTATCCGTT TTTGGTGGGG GAAATTGACC ATTTCTTAAA GACTGGTGAA
CTCCTGACAG CACCAACCCC AATTCAACGT TTGGAATGTG CTAGAGATCA CTTACAAGCC
TTATGGGAAT ATAAAGGCGA TCGCGGTGTC CGTCAAGCCC GCAAGCACAT GACTTGGTAT
GCTAAAGGTT TTGTCGGTGC GGCTGAGTTG CGTGGACAAT TAAGCGTAAT TGAAACAGTC
CAACAAGGTT TAGATTTGAT TGACAAAGCC ACTGAAAAGC TAACTCATGG TTATGAGCTA
GTGGAGGAAG CTGATAATTT TCAGGTAGCT TAA
 
Protein sequence
MVTLSPNLKA RLSQPLNIGS FVVKSRVLQS PLSGVTDMVF RRLVRRYAPD SMMYTEMVNA 
TGLHYVQQLP KIMEVDPNER PISIQLFDCR PDFLAEAAIK AVAEGADTID INMGCPVNKI
TKNGGGSSLL RQPEVAEAIV REVVKAVNVP VTVKTRIGWN DREITILDFA KRMEDAGAQM
ITVHGRTRAQ GYNGNARWEW IARVKEILSI PVIGNGDIFS VESAVKCLEE TGADGVMCSR
GTLGYPFLVG EIDHFLKTGE LLTAPTPIQR LECARDHLQA LWEYKGDRGV RQARKHMTWY
AKGFVGAAEL RGQLSVIETV QQGLDLIDKA TEKLTHGYEL VEEADNFQVA