Gene Ava_3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3002 
Symbol 
ID3681237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3721131 
End bp3723788 
Gene Length2658 bp 
Protein Length885 aa 
Translation table11 
GC content45% 
IMG OID637718348 
Producthypothetical protein 
Protein accessionYP_323507 
Protein GI75909211 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.598008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCATC CAGTTCTGCC GCCTGCACCT CCTTCCCTTG TTGAACCTAT ACAACCTGCA 
AATTCCACCC CATCTGCTAG TGATAGTCGA TTTCATTTAG TGAGTGAGAT TGATACCACT
AAGCAAAACA AACTAGAGAC CGCCAAACCC CCAGAGAAAC AGTTTCAAAA TCCAACTGTA
GAGAAAACAG AAGATTCTTT GCAGTTACCA AGACCCACCA GCAATAAAAC AGAGCATTTA
ATTGCAACTC CTGTTCTGGC AAAACCCAGT ACACCAGAAA TCTTCACTTC AGAATTTTCC
CCCCTTACCC CTTCTAGTAG CGCTGTTAAT TTAGGACAAA CCTTGACAAT TGGTTATTCT
GTCCAAAAAA ATACAAGCAC TAATTCCCCA GTCTTCCCTA CAGCGACAAT TCCTCAGTCT
GTCCAAAAAA ATACAAGCAC TAATTCCCCA GTCTTCCCTA CAGCGACAAT TCCTCAATCT
GTCCAAAAAA ATACAAGCAC TAATTCCCCA GTCTTCCCTA CAGCGACAAT TCCTCCAGAT
GTATCACCTC AAACAAAACT AGTCGCCTCG GCATTGACTA AGCAAGAACA GAAAGACGAG
GTTACTGTAC AACAGACTGG GGACAAAATT AATATCTCTG TATCTGATAG TGACCCGGTA
GTTAATAATC AACCAATTCA AAATGTTATT GAGTTTAAGT CGCGTAACTC GTTAAATAAA
CTATCAATAC AAACTGTCAC GCCAGCTCTT AGTTCACAAG CACAGTCACC AATACCAACT
CGAACACCTG CTACACCCAA CAATACACTA CCAGCCCGAC AAAGAATTGT AGAAGTGACT
TCAGATCGGC AGGAGTATGA TGAGCAAAGA CGCGTGATTA CTGCCGAAGG TAATGTATTA
GTACGGTTTG ATGGGGCGCT ACTGGATGCC GATCGCGTGC AGGTAAATTT AGATAACTTG
ATTGCGGCGG GAGAGGGAGA TGTAGTTTTA ACTAGAGGAG ATCAGTTATT ACGAGGACAA
CGCTTTACTT ATAATTTTGT CCAAGACAGT GGGGAATTAG AAAACGGAAG CGGTGAAATC
AACGTACCCT CAGCATCAAG AGATTTTGCC TTCTTACCCA CTGATGTGAC AGCTGGTGGC
GTACCACAAC AACCACCAAG CAACTCTATC CGCACCAATC AACCTCTTTC TAATGTCAGC
AGTCCCGGTG GCATTGAATT TACATTTGGG GGAGGGAGCG AAGCGAGTAA TCTGCCTCCT
CCCAAAGCTG GGGGTGAGGT GAAGCGTATT AGGTTTGAAG CCGAGCAAAT TGAGTTTTAT
CCCCGTGGTT GGCAAGCGCG AAATGTGCGC CTGACAAATG ACCCATTTTC ACCACCAGAA
TTGGAGTTAC GGGCAGATAC AGTCACCGTC ACACGGGAGG CTCCTTTAAT TGATCGGATC
ACCACACAGC GACAGCGTTT AGTCTTTGAC CAAGGCTTGA CCTTACCAAT CCCCATAAAT
CAGCAGAAAA TCGACCGTCG AGAACGGGAT AGCACACCCT TTATCGTCTC TCCTGGCTAT
GATGGCGATA AGCGGGGTGG TGTTTTTATT GAGCGCGGTT TTACACCTAT TAGTACGGAT
AGTACTAATT TGCAGATCAC TCCCCAGTTT TTCGTCCAGA AAGCCATACA AGGTAATGAT
GATGTGACGG GGTTATTTGG TGTCAAAGCC AGATTAAATT CTGTTTTAGG TACCCGATCT
GTACTTGAAG GTATCGGAGA ACTAACTAGT TTTAATTTTG ACGACATAGA CGAGAATTTA
AAGGCGAGTT TACGTTTACG CCAAGCCTTA GGCGATCGCA ATCCTCATGT GGTGAATTGG
GAATACAGTT ACCGCGATCG CCTCTATAAC GGTACACTTG GCTTTCAAAC CGTCCAAAGT
AGTTTGGGTG GTGTCATTAC TTCTCCAGTT ATTCCTCTGG GGAACTCTGG GATCAACCTC
AGCTATCAAG CCGGCGCGCA GTATATCAAC GCCAACACTG ACCGCCAAGA CTTGCTAGAT
CCCATCAGAA CCAATGATCG AGTCTCTCTT GGTCGCCTAC AAGGTAGCGC CGCCTTGAGT
AAAGGGTTTT TACTATGGCA AGGAAAACCA CTACCACCCA CAGCCACGGA AGGTTTAAGA
TATACAGCTA CTCCTGTAGT TCCTTACTTA CAGGCGATCG CCGGTATTAC TGGTACTTCT
AGCTACTACA CCAATGGCGA CAACCAAAGC AGCCTGACTG GCACAGTTGG CTTAGTAGGG
CAGCTTGGTC ATTTTTCTCG CCCCTTCCTT GACTACACAG CTTTTAACGT CCGTTATTCC
CAAGGTTTAA CCAGTGGACT ATCACCCTTT TTGTTTGACC GCTACGTTGA TGAAAGAGTA
CTCAGTGCCG GCATCTCCCA ACAAATTTAC GGCCCTTGGC GTTTAGGTTT TCAAACATCA
ATTAACTTAG ATACTGGTAA AGAAAGTAGC ACAGACTACA TTGTTGAATA TAGCCGACGC
ACTTACGGAA TCACCTTGCG CTACAATCCG GTACTGGAAT TAGGCGCTTT TAGCATCCGC
ATTAGTGACT TTAACTGGAC TGGCGGTGCA GATCCATTTT CGGAGGTTAG ACCAGTTGTC
GGTGGTGTGA GTCAGTAG
 
Protein sequence
MLHPVLPPAP PSLVEPIQPA NSTPSASDSR FHLVSEIDTT KQNKLETAKP PEKQFQNPTV 
EKTEDSLQLP RPTSNKTEHL IATPVLAKPS TPEIFTSEFS PLTPSSSAVN LGQTLTIGYS
VQKNTSTNSP VFPTATIPQS VQKNTSTNSP VFPTATIPQS VQKNTSTNSP VFPTATIPPD
VSPQTKLVAS ALTKQEQKDE VTVQQTGDKI NISVSDSDPV VNNQPIQNVI EFKSRNSLNK
LSIQTVTPAL SSQAQSPIPT RTPATPNNTL PARQRIVEVT SDRQEYDEQR RVITAEGNVL
VRFDGALLDA DRVQVNLDNL IAAGEGDVVL TRGDQLLRGQ RFTYNFVQDS GELENGSGEI
NVPSASRDFA FLPTDVTAGG VPQQPPSNSI RTNQPLSNVS SPGGIEFTFG GGSEASNLPP
PKAGGEVKRI RFEAEQIEFY PRGWQARNVR LTNDPFSPPE LELRADTVTV TREAPLIDRI
TTQRQRLVFD QGLTLPIPIN QQKIDRRERD STPFIVSPGY DGDKRGGVFI ERGFTPISTD
STNLQITPQF FVQKAIQGND DVTGLFGVKA RLNSVLGTRS VLEGIGELTS FNFDDIDENL
KASLRLRQAL GDRNPHVVNW EYSYRDRLYN GTLGFQTVQS SLGGVITSPV IPLGNSGINL
SYQAGAQYIN ANTDRQDLLD PIRTNDRVSL GRLQGSAALS KGFLLWQGKP LPPTATEGLR
YTATPVVPYL QAIAGITGTS SYYTNGDNQS SLTGTVGLVG QLGHFSRPFL DYTAFNVRYS
QGLTSGLSPF LFDRYVDERV LSAGISQQIY GPWRLGFQTS INLDTGKESS TDYIVEYSRR
TYGITLRYNP VLELGAFSIR ISDFNWTGGA DPFSEVRPVV GGVSQ