Gene Ava_3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3090 
Symbol 
ID3681012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3840057 
End bp3841694 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content44% 
IMG OID637718435 
Producthypothetical protein 
Protein accessionYP_323594 
Protein GI75909298 
COG category[R] General function prediction only 
COG ID[COG4188] Predicted dienelactone hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000551056 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATCA CTGGGAAAAG TATAAAACTT TTTGCTGGTC TTGTCTGCAC ATTTAGCCTC 
ACACAGTTTT TGGCTACAAA TACTCCTGTA CAAGCAGCTG AAACAGTCGT AGTGCGATTT
GGGTTATTTG CCGAATCTAT ACCTGTGGCT GACTTACAAA AAGCGGCGGA GACTGGCGAA
TTTCCCAGCA GTTTAAACCT ATTCACAAGA CGATTATCAG AACAACAACG CCGTACCCTC
ATCGGAGCGC TGAGGATGAG AGTACCGTTG AATGTTGTTA CCATCAGCAG GTTACTAAAT
ACTCAAATTG GGACAACTAT TCTCAATGAC TTATCCAGGG CTGTGGTTCG CAAAGATCAA
TCTGGTGCAA AAGCCTTAAG AGCTAGTTTG GTATTGGGTT CTACAGCACC ACAGGGTCTT
TCAATTCTCA GTTTTATTAC TGCTTACCCC AGTCGCAGCC TAGAAATTAA CCTACCCCAA
GCCTTCCAAG TCGCGGGGAG TTTAAACAAT GCTTTCTGGC GTACACAACA ATTTATGCTG
GCGATTAGTC CTCAACTTGA TCCCGCCAAA CCTCAGATTT CCATACCTTT TGACCCCAGC
CAACCAGGAA ACGCTCAAGT GCAAGTACTG AAATTGAACT TGAATGATCA AAAACGCAAC
CGCCAAATTC CGGCGGATAT ATACTGGTCA ACTTCTGCAA CTCAGGAAAA ACCCGTAATT
ATCTACTCCC ACGGGATGGG ATCAGTCCGC ACAGATTTAC ACTATCTAGC CGAACATCTA
GCCTCCCACG GTTATATATT TGTCGCTTTA GAACATCCGG GAAGTAATCA GGCCAATACA
GATTTAGCAA CCAAAGGTAA AGTGCGACTT TTAGAACCCC AAGAGTTTTT AAATCGTCCT
CAAGATGTCA GTTTTGTTTT GGATGTATTA GAAAAACTCA ACCAAACAAC AGGTAATCCC
TTACAAGGGA AATTGGCAAC TAATAACACG ATGGTGATTG GCTATTCTTT TGGCGGTGGT
ACAGCTTTAT CCTTGGCTGG AGCCGAGTTA CAAATAGCAG GAATCAGAGA ACGCTGTCAG
AATAAATTAA CTATTTTAAG CCTGGGAGAA ACTATCCAAT GTGTTGCTCA AGAATTGCCA
GAAAAAACTT ATCAACTGCG GGATAACAGA ATTAAACAAG CAATAGCTTT AACTCCCACA
ACTTCATTAA TGTTTGGCGA AACTGGTTTA ACAAAGGTGC AAATCCCGAC TTTAATTGTC
GCGGCTTCCG CAGATAAAAC CACCCCTGCT TTAACCGAAC AAATTTTGGG ATTTAGCAAA
ATTCCATCGC CGAAATGGTT GGTGGGTATC ATTGGTGGTA CACATTTGAG TGTGAAAGAC
CCCAGCACCA CATTAGATCA GGTGGACAAA CCCAACACAC CCCTGACTGG TGGTGAAATA
GTCGGAGAGC AAGCTACCGA TGTTCGCCAA TTTGTCAAGG CGATCGCTCT AGCAATGGTG
GCACAACTCA CGCCAGAAGC CGAAAAATAT GCGGTCTTCC TCACCCCAGA TTACGCTCAG
TTAGCTTCAA CCGAGTCATT TCCTTTTCGC ATAGTTACAG AAATCCCTCC ACAAGCTATT
CCTATAGGTA AACAATAG
 
Protein sequence
MGITGKSIKL FAGLVCTFSL TQFLATNTPV QAAETVVVRF GLFAESIPVA DLQKAAETGE 
FPSSLNLFTR RLSEQQRRTL IGALRMRVPL NVVTISRLLN TQIGTTILND LSRAVVRKDQ
SGAKALRASL VLGSTAPQGL SILSFITAYP SRSLEINLPQ AFQVAGSLNN AFWRTQQFML
AISPQLDPAK PQISIPFDPS QPGNAQVQVL KLNLNDQKRN RQIPADIYWS TSATQEKPVI
IYSHGMGSVR TDLHYLAEHL ASHGYIFVAL EHPGSNQANT DLATKGKVRL LEPQEFLNRP
QDVSFVLDVL EKLNQTTGNP LQGKLATNNT MVIGYSFGGG TALSLAGAEL QIAGIRERCQ
NKLTILSLGE TIQCVAQELP EKTYQLRDNR IKQAIALTPT TSLMFGETGL TKVQIPTLIV
AASADKTTPA LTEQILGFSK IPSPKWLVGI IGGTHLSVKD PSTTLDQVDK PNTPLTGGEI
VGEQATDVRQ FVKAIALAMV AQLTPEAEKY AVFLTPDYAQ LASTESFPFR IVTEIPPQAI
PIGKQ