Gene Ava_4851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4851 
Symbol 
ID3679349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6110564 
End bp6113596 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content42% 
IMG OID637720208 
Producthypothetical protein 
Protein accessionYP_325343 
Protein GI75911047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00405142 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAAAAT TTACCTTTTA TTTAATATTG TTCAATGGAT TATTTTTAGG AATTGCAACA 
GCGAACGCCA AGTTATCACC TGTAGATATA GTTAATCAAC ATTTGCAAAA TTCTCCAATT
ATAGATATAA CAACTAAGCC TCAAAAACCC ATAACCAATC AACACCCCCA TATATCTACA
CCCTTACATC CTGCTTACAC TCTTTCCCCC TCTGCTGCTA TCAAGCAATC ACATCAGCCT
CTACTACAAA GGCCAGAAAA AATTTGGGTA ATTAACCAAA ATCAACAGGT CAAGGATCAA
CCCTTTATTT GGGTGGTGAA TGATCATAAA AAGGCAGCAC AGCAACCATT TCTACAAGTT
GGTAAATCCA CAGATAAACC GGATAAGAAA CCTGCTACAG AATCTAAGCC GGAAAAAAAG
GATGACTTAG AATCTTTTGA TGAAGTAGTA AAAGATACTG AAAAACTAGA CGGTCTATTT
ACTCTCTATC GTCATAAAGA AAAGAATAAA ATATATCTAG AAATTAAGCC AGAACAGCTA
AATAAAAATT ACTTAGCTAC CGCAACCCTA GAATCTGGTA TTGGCGAACA AGGAATTTAC
AGTGGTTTAC CATTACAAGA CTTTTTATTT TATTTCCAGA GAGTAGACAA AAAACTATCT
TTTGTGGTGC GTAATGTGAA TTTTCGCACA AGGGAAGGTG ATCCACAAGC GCGATCGCTC
GCCCGTTCGT TTAGCGATTC CGTTCTCTAC TCGGTGGAAA TCAAAAGCAT CCATCCCCAA
AGAAAAACCT TGTTAATTGA CTTGGGTGAC TTGCTGCTGG CAGATTTAGC CGGATTATCT
TTGTTTACAG GATTGACTCC AAATACAGAC CAGTCTTCCT TTGGCAGTGC TAAAACCTTT
CCCCACAACT TAGAAATTGA GTCGGTATTG AACTTCTCTA GCAGTACTGG TACAAACCCT
AACAATGAAA TGTTATATTT CACGACCGTA CCAGATAGTC GTGGCTTCAC CCTCAGGGTT
CACTATAGTC TTTCCCAACT ACCAGAAAAT AATTATCGTC CCCGGATAGC TGATGAACGG
GTTGGTTACT TTATCACTGC TTACCAAGAT TTATCTAAAG AAGAACGCAA CGATCCTTTT
GTCCGCTATA TTAATCGCTG GCACTTAGAA AAGAAAGACC CGGAATCATC CCTATCTCGT
CCCAAAAAAC CCATTGTCTT CTGGATTGAT AACGCCGTAC CCTTACAGTA CCGCGAAGCT
GTCAAAGAAG GGATACTCAT GTGGAACAAG GCCTTTCTTA AGGCGGGATT TCAAGATGCA
GTGGAAGCCA GACAAATGCC AGACAATGCC GCATGGGACC CAGCCGATAT TCGTTACAAT
ACAATTCGTT GGATTAACAC TGTTGATGGT TATTTTGCTA TGGGGCCATC TCGCGTTAAT
CCTTTAACTG GGGAAATTTT GGATGCAGAC ATATTAGTTG ATGCTAGTCT TGTCCGCTTA
CTCAAAAATC GATACAGCAC ACTTGTAGAA CCTAGTCAAC TCAATACCCG TACCTCCTTA
TCGGCATTAA TGCGGAATCG GGGACTTTGT AACAAAGGTT TAGCCGCAAA AGCCAACAAC
ACTACTCAAG AAAAATCTCC AAGACCAAAT GGTTTTTTGC AGCGTTTATC CAAGCTAGCC
GGTGATTATG ACTTATGCTA CGGCATGGAA GCCGCCAATC AATTTGCTTT TGGGGCTTTG
TCCATGTCAC TGCTACAAAA CAACGCACCG AATCAAGAAC AGCTACAAGA ATATATCAAT
CAATATTTAC GTTTAATTGT TGCCCATGAA GTAGGACATA CCCTGGGTTT ACGTCATAAC
TTCCGTGGTA GTAATCTGCT ATCACCAGAA GAGATGAACA ATCAAGAAAT TAGCCGCCAT
AAAGGTTTGA CAAGTTCGGT GATGGACTAT ATTCCACCGA ATATTGCCCC CCAAGGGACA
CCGCAGGGAG ACTATTTTCC CAGTATGGTG GGGTCTTATG ATGATTGGGC TATTCAGTAC
GGTTATACCC AAACCAACGC GAAAACTCCC ATAGCAGAAA AGCCGATTTT ACAAGCAATC
GCCAGCCAAT CTTATAAGCC GGAATTGAGT TATTCTCCCG ATGAGGATAT GTATGACCTC
GACCCCACCG CCGATGCTTG GGATCATAGT GGTAACGTGC TGGTTTATTC TCAATGGCAA
TTAGATAATT CTCGGTTGAT GTGGGCAAAT CTCAATAAAC GTTTCCCTAT GCCGGGAGAA
AGTTATAGTG ATTTAAGCGA TCGCTTTAGC TCAGTTCTCA GTAACTATTT TCAGAATATC
TTCTACACAA CAAAATACAT TGGTGGGCAG TCCTTCTACC GTCTACAGGC TGGGGAAATA
TCAGCTACTA AGCTAGCCAG TCGCCCAAAT CACTTACCCT TTGAACCTGT ACCTGTTGAA
CAACAACGGC AAGCACTCAA AACACTACAA AAGTATATTT TTGCTGAAGA TGCCCTGAAT
TTTCCCCCAG ACCTACTGAA TAAATTAGCA CCTTCTCGCT GGTATCACTG GGGGAGTTTT
CCCCAAATTG GCCGCTTAGA TTATCCCGTT CATGACTTAA TATTATTCCT GCAAGCTGCT
GTATTACGGG AATTACTGGC AGGCGATCGC CTCACTCGTC TCAAGGATAT TGAACTCAAG
AGCTTACCCG AAAAATCACT AGCATTACCT GAGCTTTTTG ATACTTTGCA AGCTGGAGTT
TGGACAGAAG TTCTCAAACC AAAAGCCGGG GCGCTGAAAA TTACTAGCCT CCGGCGCGGC
TTGCAGCGAG AATACCTCGA TATCTTGATT GGTATGGTGT TGCGGCGAGA ATACGTCCCG
GAAGATGCCC GTTCCTTGGC TTGGTATAAA CTTAAACAAT TAAACGACAA ACTCAAGTCA
GTCAATACCA ACGATGAATA TACCAAAGCC CACTTGTTAG AAACTAGCGA TCGCATTGAG
AAAGCTTTGA ATGCCCCATT GCAGGCGAAT TAG
 
Protein sequence
MRKFTFYLIL FNGLFLGIAT ANAKLSPVDI VNQHLQNSPI IDITTKPQKP ITNQHPHIST 
PLHPAYTLSP SAAIKQSHQP LLQRPEKIWV INQNQQVKDQ PFIWVVNDHK KAAQQPFLQV
GKSTDKPDKK PATESKPEKK DDLESFDEVV KDTEKLDGLF TLYRHKEKNK IYLEIKPEQL
NKNYLATATL ESGIGEQGIY SGLPLQDFLF YFQRVDKKLS FVVRNVNFRT REGDPQARSL
ARSFSDSVLY SVEIKSIHPQ RKTLLIDLGD LLLADLAGLS LFTGLTPNTD QSSFGSAKTF
PHNLEIESVL NFSSSTGTNP NNEMLYFTTV PDSRGFTLRV HYSLSQLPEN NYRPRIADER
VGYFITAYQD LSKEERNDPF VRYINRWHLE KKDPESSLSR PKKPIVFWID NAVPLQYREA
VKEGILMWNK AFLKAGFQDA VEARQMPDNA AWDPADIRYN TIRWINTVDG YFAMGPSRVN
PLTGEILDAD ILVDASLVRL LKNRYSTLVE PSQLNTRTSL SALMRNRGLC NKGLAAKANN
TTQEKSPRPN GFLQRLSKLA GDYDLCYGME AANQFAFGAL SMSLLQNNAP NQEQLQEYIN
QYLRLIVAHE VGHTLGLRHN FRGSNLLSPE EMNNQEISRH KGLTSSVMDY IPPNIAPQGT
PQGDYFPSMV GSYDDWAIQY GYTQTNAKTP IAEKPILQAI ASQSYKPELS YSPDEDMYDL
DPTADAWDHS GNVLVYSQWQ LDNSRLMWAN LNKRFPMPGE SYSDLSDRFS SVLSNYFQNI
FYTTKYIGGQ SFYRLQAGEI SATKLASRPN HLPFEPVPVE QQRQALKTLQ KYIFAEDALN
FPPDLLNKLA PSRWYHWGSF PQIGRLDYPV HDLILFLQAA VLRELLAGDR LTRLKDIELK
SLPEKSLALP ELFDTLQAGV WTEVLKPKAG ALKITSLRRG LQREYLDILI GMVLRREYVP
EDARSLAWYK LKQLNDKLKS VNTNDEYTKA HLLETSDRIE KALNAPLQAN