Gene Ava_3138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3138 
Symbol 
ID3680772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3900840 
End bp3902489 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content43% 
IMG OID637718487 
Productpseudouridine synthase 
Protein accessionYP_323641 
Protein GI75909345 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTA CAGAGGTTCT ACACCCGTTG TCAGATTTTA TTGAATATGA TTTGACCGAT 
GATGACTTAT CAGCTAACTA TTGGTATGAA GGGTATTGTC TGCAATCTGG TGATTTATTA
AGGCTGCCTC GTACTGCTTT AGTAGAGACG ATCGCTCATA GTTTAATGCA ATACCTTGCT
AAGGATGAAC GTTATTCCCG TGAAGGTAAA ATGTATGGCA TATTGCTAGT TGAAATACCC
ACGGGTGAAA GACGAGTACT TAAAGCTTTC TCTGGGTCGC TAAACGGACA AAATGTATTT
GATGGCTGGG TTCCACCAAT TCCGGGAAGA GAACAAGTTG CTATACAAGA GGAGCATACT
TTAGCTGAGT TAGATGCTAT TAAGCAGGAA TTGATTACCC TCAAGCAACT ACCACAAAGA
CAAGAGTACG AAACTCTTTC TAGAGAGTTT GAGCAGCAGT TGCAAGCAAT GAGCGATCGC
CATCGTTATC GCAAATCTCA ACGACAACAA CAACGTCAGC TAATAGACGA AACTATTTCA
CCAGCAACCC TCACTACTAC TTTAGAACAG CTTGATGAAG AAAGTCGTCA GGATGGAATT
GAACGACGGC GACTCAAGCA AGAACGAGAC ACAGTTCTGC AACCATTACA AGAAGCGATC
GCCTCAGCAA ATATTAAAAT ACAACACCTC AAGCAAAAGC GGAAAGCCCT ATCTCGGCAA
TTACAGGTGC AAATGCACGC TGCTTACTCC CTGATGAATT TTTTAGGGCA ATCTGTATCA
TTACAGCAAT TGATGCCGAA TGGCTTACCT ACAGGAACGG GAGACTGTTG TGCGCCAAAA
CTCTTACACT ATGCAGCCAC GCATGGACTG AAACCTTTAG CAATGGCTGA ATTTTGGTGG
GGTTCATCCT GTCAAGATAA AATTCAGGGT GAGTTTTATG GCGCTTGCAT GGAACGTTGT
CAGCCGTTGA TAGGATTTTT GTTGTCGGGT TTGAAACCTG ACTCAAACTT TGACAAAGAA
CAAATTAATG TGATTTATGA GGACGAATGG CTGATTGCGG TGAACAAACC TGCGGGGTTG
TTATCAGTTC CTGGTCGTTA TTTTGATACC CAAGATAGCG TTCTTAGCCG TTTGCGCCGT
TTGTTAACTC AGGAAACAAT GCTTGCTGCT GTGCATCGCT TAGATCAAGA TACCTCTGGT
ATTCTCTTAC TGGCAAGAGA CAGGCAAACT TATCGTCAAC TTAGCCAGCA GTTTCAACAG
CGACAAGTTC ATAAGGTTTA TGAAGCCATA CTTGCCGGCG TTGTCAGCAC AGAGACTGGG
ATAATTGATT TACCATTGTG GGGAGATCCA GAGAATCGAC CTTATCAGCA AGTTGATTGG
CAACGTGGTA AACCTAGCGT GACAAACTTT CGGGCGATCG CCAGGGAAGG AGATTACACC
CGCGTAGAAT TTGTACCACT CACCGGACGC ACCCATCAAT TAAGAGTTCA TGCGTCAGAT
GTGCAAGGAT TGGGGGTGGT AATTTTGGGC GATCGCTTTT ATGGTTGCAC TGCTAAAGCA
AATCGATTAC ATTTGCACGC TAGAGAACTC TGCTTTCTGC ATCCACACTC AGGAAAAATA
ATTCACTTAC AAGTAAAGAC ACCATTTTAA
 
Protein sequence
MPFTEVLHPL SDFIEYDLTD DDLSANYWYE GYCLQSGDLL RLPRTALVET IAHSLMQYLA 
KDERYSREGK MYGILLVEIP TGERRVLKAF SGSLNGQNVF DGWVPPIPGR EQVAIQEEHT
LAELDAIKQE LITLKQLPQR QEYETLSREF EQQLQAMSDR HRYRKSQRQQ QRQLIDETIS
PATLTTTLEQ LDEESRQDGI ERRRLKQERD TVLQPLQEAI ASANIKIQHL KQKRKALSRQ
LQVQMHAAYS LMNFLGQSVS LQQLMPNGLP TGTGDCCAPK LLHYAATHGL KPLAMAEFWW
GSSCQDKIQG EFYGACMERC QPLIGFLLSG LKPDSNFDKE QINVIYEDEW LIAVNKPAGL
LSVPGRYFDT QDSVLSRLRR LLTQETMLAA VHRLDQDTSG ILLLARDRQT YRQLSQQFQQ
RQVHKVYEAI LAGVVSTETG IIDLPLWGDP ENRPYQQVDW QRGKPSVTNF RAIAREGDYT
RVEFVPLTGR THQLRVHASD VQGLGVVILG DRFYGCTAKA NRLHLHAREL CFLHPHSGKI
IHLQVKTPF