Gene Ava_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4226 
Symbol 
ID3680932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5298424 
End bp5302788 
Gene Length4365 bp 
Protein Length1454 aa 
Translation table11 
GC content42% 
IMG OID637719574 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_324720 
Protein GI75910424 
COG category[S] Function unknown 
COG ID[COG1572] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.542738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTTA CCCGTGAATA TAACAACTTA AATAGGTTGG ATAATTTAAG CGATGCCTTA 
TCATCTGCTA GCAACTCTAG GATTTTTAAT AGTAATAAAG ATTACTTCCC CCTCGATACT
GCTGTAAGCT CAATCATCGG TAGTAGTTAT ATAGCTGAAG CGACAGGAGT TTTAGCATCT
AATAGCTACG CAAGTAATGA TTTAAGCAGG AGTAATTTAG GTAACACTAA TTTTACACAA
GCATCACTAC TACCAGACCT AACAGCACCA ATCGGCTCAA TTCCCACTTC TGCTAATGTC
GGGAGCAGCA TTCAAATCAA CTATCAAGTC AGAAACCAAG GTAGTGCAAG TGCTGGAGGC
AGCTTCACCA ACTTCTATTT ATCTCCAGAT TTTAATCTCG ATAGTAGTGA TAGGTATCTA
GGCTTTGACT ATGTGAGTGG TTTGGCGGTT GGTGCTTCTA GCCAAAAGTC AGCCACACTC
ACCATTGGTA GTAATATCAA CCCTGGTAAT TACTATTTGA TTTACTATGT GGATGGTGAT
GGCTATGTGA GTGAAAGTAA TGAAAATAAC AATATCTTCG GTGCAGCAAT CTCGATTACC
CAGCCAGACT TAACAGTATT AAACGCTTCA ATTCCCACCT CAGCAAGAGT CGGGAGCAGT
ATTCAAATAA ACTATCAAGT CAGAAATCAG GGTAATGGAA GTGCTGGTGA TAGCAATACC
AAATTCTACT TATCTCCAGA TTTGAATATT GATAGTAGTG ATTTCTATCT AGGTTTGGAC
TATGTGAGTG GTCTGGCTGC TGGTGCTTCT AGACAAGAGT CAGCCACATT CACCATTGGT
AGTAACATCA ACCCTGGTAA CTACTATTTG ATTTACTATG CTGATGCTGA CGGCTATGTG
AGTGAAAGCA ATGAAAATAA TAACGCCTTT GGCACTCTAA TTAATATTAC TTCGGCAGGG
AATCCAGACC TAATTATTCA AAACCCCACA GCACCAACGA CAGCATCAGT AGGTAATACC
ATCCAACTGA GTTATCAGGT AAGAAACCAA GGTCTGGGGA ATGCGGTTGC TAGTACCACC
AGATTTTACC TTTCTAGAGA CACAACATTT AGTACAGATG ATGTGTTGTT AGGTTCCGAT
TCTGTCGCTA GCATCGCCGC AGGTGCTGTC AGTTCCGAAA CAGCCTCCAT TGTTATAGCC
AATAGCATCG CGGCTGGTAA CTATCATTTG TTGTTCAGAA CTGATGCAGA TAATAACTTG
GCGGAAAGTA ATGAAACTAA TAATTTGGTT TCTAGAACCA TCACCATTAA CACAGCTGAT
TTAATCGTTC AAAACCCCAC AGCACCAACG ACAGCATCAG TAGGTAATAC CATCCAACTG
AGTTATCAAG TAAGAAACCA AGGTGCTGGA AATGCCGTTG CGAGTACCAC CAGATTCTAC
CTTTCTAGAG ACACAACATT CAGCACAGAT GATGTGTTGT TAGGTTCAGA TTCTGTCGCC
AGCATCGCCG CAGGTGCTGT CAGTTCCGAA ACAGCCTCCA TTGTTATAGC TAATAGCATC
GCTGCTGGTA ACTATCACTT GTTATTCAGA ACTGATGCAG ATAATAACTT GGCGGAAAGT
AATGAAACTA ATAATGTGGT TTCTAGAACC ATCACCATTA ACACAGCTGA TTTAATCGTT
CAAAACGCCA CAGCACCAAC GACAGCATCA GTAGGTAGTG CGATCGCATT GAGTTATCAG
GTAAGAAACC AAGGTGCTGG AAATGCCGTT GCGAGTACCA CCAGATTCTA CCTTTCTAGA
GACACAACAT TCAGCACAGA TGATGTGTTG TTAGGTTCAG ATTCTGTCGC TAGCATCGCC
GCAGGTGCAG TCAGTTCCGA AACAGCCTCC ATTGTTATAG CCAATAGCAT CGCTGGAGGT
AACTATCATT TGTTGTTCAG AACTGATGCA GATAGCACAG TAGCGGAAAG TAATGAAACT
AATAACATCG TTTCTAGAGC CATCACGATT AATGGCCCAA GACCCGACCT GATTATCCAA
AATATTTCTG CTCCTAGCAT TGTTGACCCT GGTAACATAT TTACACTCAA CTACCAAGTA
GCAAATCAAG GTACTGCTAG TGCTGGTAAT CATAGAACTA AGATTTATTT GTCTAGAGAC
ACAACTCTTA GTAGTGATGA TATATTGTTA GCCTCTGACC CTAACTACTT CTATCCTGTA
CTAAATGCTG GCACTTATAG TTCAGAATCT TACCTTTTGT CTATAAGTAG GGATATCAAT
TTTGGTAACT ACCATCTGCT ATTGCAAGCT GATGGCAATG ACGAAATTAG TGAAAGCAAC
GAAAGCAACA ATGTCACCGC CAAAGCGATT ACAATTGCTG CACCAGATTT AATTGTCCAG
AATCCTTCGG CTCCTGCCAG CGCTAACATT GGTACAACAA TTTCACTCAG CTATCAATTA
AAAAACCAAG GTAATGGCAA TGCAGGTTTT CATTTCACTA ATTTTTATCT TTCCCAAGAC
CAAACGCTTA GTAATGATGA TGTCTATTTA GGCTTTGATG CCATCTCTAG TCTTGCACCC
TCTGTTGTAG CTTCACGCTC TACATCTTTA ACAATTAGAA GCAATACTGT TCCTGGTAAT
TACTATCTGC TTTATAAAGC TGATGGTGAT GGAACGATAA GAGAAAGCAA TGAAAATAAT
AACGTCGCTG CGAGAGCAAT CACGATTACT GCACCAGATT TAGTGATTGA AAATGCTACC
TCTGCTGGTA GTGCTGCCAT CGGGGCTACA CTGCAAGTTA ACTATCAACT GAAAAACCAA
GGTAATGGCA CTGCTGGAGG CAGTAAGACA AGTTTTTATC TTTCACGAGA TGGGGCATTT
GGTGATGATG ATATCTACTT AGGTATGGAA ACTCAAGCTA GTGCTAGTGT GACACCTGGT
GCCTCTATTT CTCGCTCCAC GGCTATTACG CTTGATCCTA CGATCAATCC CGGCCAATAC
TACCTCATAT TTAGAGCGGA TGGAGCAGGA TCTGTTGCTG AGAGCAATGA AGGTAACAAC
GGCCTTTACA TCACAGCACC AATCAATATC ACTCCTATTA ATGGCGGTGG ATTTAACTCC
ACTACAGGTT ATGGTTTAGT CAACGCCGCC GCCGCCGTGG CCAAAGCCCT TAATCAAAGC
ACCTTTGCTG ATGTAGCTGA CTTGGGTGGT AATGATTGGG GAGCAGATGC GATTAAAGCA
CCTGAAGTCT GGGCAAGGGG ATACACTGGT CAAGGTGTGA TCGTTGCGGT GGTGGATAGT
GGAGTGGACT ATACACACCC AGACTTGAGT GCCAATATGT GGAGGAATAG CAGAGAAACT
GCGGGTAATG GCATAGATGA TGATGGTAAT GGTTTTATTG ATGATGTTTA CGGCTGGAAC
TTTTTCGGCA ACAACAACAA TCCACTAGAT GATAACGGTC ATGGTACTCA TGTAGCTGGT
ACTATTGCTG CGGTGAGAAA TACCTTTGGT GTCACTGGTA TTGCCTATAA TGCCAAGATT
ATGGCATTGA AAGCGTTGGG TGGTCCTCAA GGAACAGGTT CAGATGATAT GGTTGCCAAC
AGTATTCGTT ATGCTGCGAA TAATGGGGCG CGAGTGATTA ATCTCAGTTT AGGAGGATCT
AATCCTGCAC CTGATATTCT CTCAGCTATT CAATATGCCA TCAGTAAAGG AGCAATTGTT
GTTTCCGCAT CTGGGAATGA AGGTCAATCT CTACCAGGCT ACCCAGCTCG CTATGCAGAC
CAGTTTGGAA TTGCTGTGGG AGCAGTCAAT TACAATAGAA CCTTAACCGA TTTTTCCAAC
CGTGCTGGAA CAACTCCACT GGCATACGTT ACAGCCCCTG GTGCATACGA TGACTTTTTT
GGCATTGGTA TATATTCGAC CATACCAGGG GGTGGCTATG GTTTGAAGCC GGGAACATCA
ATGGCTGCTC CTCACGTTGC GGGTGTGGTG GCGCTGATGC TGAGTGCGAG GAATAATCTC
ACTGATGCTC AAGTGCGTCA GATTCTCACT TCTACAGCCG CGAATGGTGG TACACTCCCC
AGTGCTAATT TGAGTACTTT GTCAAACACA GGTAGTAGTA ATACGACGCT TTCTGGATTA
ACTTCTAGCT ATACAAGTGC TAGTCTAGTT GAGATTTCTA ACTGGACTAC AATTGCTCGT
GAACAAAGCA ATACTATAGA TTTGGCAAAT TTAAGAAGAA CATTTGCCTA CTATCAAGAC
AGTCGTGATT ATCAAGGTGT TCTCTCTTCT CAAGAAGTAG ATATTGATGA GGAAAGTATA
AACAAGAGAC GCAGAAATAC ACCTAAGACC GTCAGGATTA GTTAA
 
Protein sequence
MSVTREYNNL NRLDNLSDAL SSASNSRIFN SNKDYFPLDT AVSSIIGSSY IAEATGVLAS 
NSYASNDLSR SNLGNTNFTQ ASLLPDLTAP IGSIPTSANV GSSIQINYQV RNQGSASAGG
SFTNFYLSPD FNLDSSDRYL GFDYVSGLAV GASSQKSATL TIGSNINPGN YYLIYYVDGD
GYVSESNENN NIFGAAISIT QPDLTVLNAS IPTSARVGSS IQINYQVRNQ GNGSAGDSNT
KFYLSPDLNI DSSDFYLGLD YVSGLAAGAS RQESATFTIG SNINPGNYYL IYYADADGYV
SESNENNNAF GTLINITSAG NPDLIIQNPT APTTASVGNT IQLSYQVRNQ GLGNAVASTT
RFYLSRDTTF STDDVLLGSD SVASIAAGAV SSETASIVIA NSIAAGNYHL LFRTDADNNL
AESNETNNLV SRTITINTAD LIVQNPTAPT TASVGNTIQL SYQVRNQGAG NAVASTTRFY
LSRDTTFSTD DVLLGSDSVA SIAAGAVSSE TASIVIANSI AAGNYHLLFR TDADNNLAES
NETNNVVSRT ITINTADLIV QNATAPTTAS VGSAIALSYQ VRNQGAGNAV ASTTRFYLSR
DTTFSTDDVL LGSDSVASIA AGAVSSETAS IVIANSIAGG NYHLLFRTDA DSTVAESNET
NNIVSRAITI NGPRPDLIIQ NISAPSIVDP GNIFTLNYQV ANQGTASAGN HRTKIYLSRD
TTLSSDDILL ASDPNYFYPV LNAGTYSSES YLLSISRDIN FGNYHLLLQA DGNDEISESN
ESNNVTAKAI TIAAPDLIVQ NPSAPASANI GTTISLSYQL KNQGNGNAGF HFTNFYLSQD
QTLSNDDVYL GFDAISSLAP SVVASRSTSL TIRSNTVPGN YYLLYKADGD GTIRESNENN
NVAARAITIT APDLVIENAT SAGSAAIGAT LQVNYQLKNQ GNGTAGGSKT SFYLSRDGAF
GDDDIYLGME TQASASVTPG ASISRSTAIT LDPTINPGQY YLIFRADGAG SVAESNEGNN
GLYITAPINI TPINGGGFNS TTGYGLVNAA AAVAKALNQS TFADVADLGG NDWGADAIKA
PEVWARGYTG QGVIVAVVDS GVDYTHPDLS ANMWRNSRET AGNGIDDDGN GFIDDVYGWN
FFGNNNNPLD DNGHGTHVAG TIAAVRNTFG VTGIAYNAKI MALKALGGPQ GTGSDDMVAN
SIRYAANNGA RVINLSLGGS NPAPDILSAI QYAISKGAIV VSASGNEGQS LPGYPARYAD
QFGIAVGAVN YNRTLTDFSN RAGTTPLAYV TAPGAYDDFF GIGIYSTIPG GGYGLKPGTS
MAAPHVAGVV ALMLSARNNL TDAQVRQILT STAANGGTLP SANLSTLSNT GSSNTTLSGL
TSSYTSASLV EISNWTTIAR EQSNTIDLAN LRRTFAYYQD SRDYQGVLSS QEVDIDEESI
NKRRRNTPKT VRIS