Gene Ava_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1143 
Symbol 
ID3683397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1400667 
End bp1402421 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content42% 
IMG OID637716479 
Productsurface antigen (D15) 
Protein accessionYP_321662 
Protein GI75907366 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0908627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00528068 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAAAAATA TTGACTTAGG CAGGAATGAT GCCCGAATTA AAGCAAAATT TCCCTGTCCC 
AGATTATTTC TGCTACTAAC CTTAATTAGC TTTCCTGGTG TTGCGTCTGC CCAATCTACA
CCACCGGCAG GGGTGACAAT TCCCCCTACT ACACCAGAAA CTATCGACCA AACTATCCCT
AAACCCTCCC CTATTCCCAC TGTTCCTACT CCCCCTTCTC CTACCACACC TATTCTTCCT
GTGCCTCCTG TGCCAACACC ATCGGATGTT ACTTCTCCTA CTGGTGAAAG CTTTTTAGTG
ACCAAAATTG AAGTTTTGGG GTCTACAGTC TTAAAAAATG AAATCGCCAA GTTGATTAAA
CCATTCGAGA ACCGTCGAGT TACTTTTGCA GATTTAATTC AACTACGCTC AGATATTACC
GAACTTTATA TTGAAAACGG CTACATCACC AGTGGCGCAT TTTTACCCAA CAATCAAAAT
CTGACTGATG GTGTGGTAAA AATTCAGGTG GTGGAAGGAG AACTAGAGAA AATTGAGATT
ACTGGGTTAA GAAGTCTTCA ATCAGTATAC GTGCGATCGC GTCTCGCCAA AGCTACCTCC
ACGCCATTAA ATCGCCAACG CATAGAAGCA GCATTGCAAC TATTACAACT AGACCCCGTG
ATTCAACGGG TAAATGCTGA GTTAACTGCT GGCAGTACTT CTGGTAGTAG TATCTTGCTA
GTAAATATCA CCGAAGCACC AGCATTTCAT AGGGGAGTTT TTACAGCCAA TAACCAAACT
CCCAGTATTG GTTCGACTCA ACTAGGGGTA TTTTTGAATC ATGATAACTT GCTCGGTTTT
GGCGATCGCC TCGCTGCCGA ATATACAATT ACTGAAGGAC TTAACTTGTA TGATGTCAGC
TATACAATTC CGGTAAACGG CAATAATGAC ACATTGAGTT TTCGGGTAAA TAATGCCAAT
AGCCACATTA TTACAGATGA TTTCCGCGAT TTGGACATTA GAAGCGAAAC TCAAACCTAT
TCCCTCAGTT ATCGTCATCC TCTCTACCGC CAACCCCAAA CAGAACTTGC CCTGAGTTTA
GGCTTAGATT TACGTCGTTC GCAAACATTT CTCCTCGATA ATATCCCTTT TTCTTTTTCT
CCTGGTGCGG AGGATGGAGA ATCAAAAATT ACCGCCATTC GTTTTTCTCA AGATTGGGTA
AAACGAGATT CCACAAGTGT TTTAGCAGCT CGCTCCCAAT TTAGCCTTGG TATTGGCGCT
TTTGATGCTA CAGTCAACGA CACTGATACG GATGGGCGCT TCTTTTCTTG GTTAGGACAA
TTTCAATGGG TGCAGTTATT ATCTTCACGA ACATTAATCC TCACTAGAGT CAATGCCCAA
CTGACGGGAG ATGCTTTATT ATCATTAGAA AAATTTAGTA TTGGTGGGTT TGATACAGTT
CGTGGTTATA CTCAAAACAA ACTCGTAGCC GACAATGGTT TTACGGCTTC TGTGGAAGTT
CGTCTTCCCT TAACTGCTAA CTCTAATGCT TTGCAGATAG CACCTTTTTT TGATATTGGT
ACTGTGTGGA ATAATCGCGG TAGTAATCCC CAACCACAGA CAATCTCCAG TCTCGGTTTA
GGCTTGCTTT GGCAACCAAG TCGAGATTTA AACCTACGTT TAGATTATGG TATTCCCTTA
ACGAATGTTA ACTATAGCGG AAACACACTG CAAGAAAATG GTCTTCACTT TTCACTGCGT
TATCAACCAT TTTAA
 
Protein sequence
MKNIDLGRND ARIKAKFPCP RLFLLLTLIS FPGVASAQST PPAGVTIPPT TPETIDQTIP 
KPSPIPTVPT PPSPTTPILP VPPVPTPSDV TSPTGESFLV TKIEVLGSTV LKNEIAKLIK
PFENRRVTFA DLIQLRSDIT ELYIENGYIT SGAFLPNNQN LTDGVVKIQV VEGELEKIEI
TGLRSLQSVY VRSRLAKATS TPLNRQRIEA ALQLLQLDPV IQRVNAELTA GSTSGSSILL
VNITEAPAFH RGVFTANNQT PSIGSTQLGV FLNHDNLLGF GDRLAAEYTI TEGLNLYDVS
YTIPVNGNND TLSFRVNNAN SHIITDDFRD LDIRSETQTY SLSYRHPLYR QPQTELALSL
GLDLRRSQTF LLDNIPFSFS PGAEDGESKI TAIRFSQDWV KRDSTSVLAA RSQFSLGIGA
FDATVNDTDT DGRFFSWLGQ FQWVQLLSSR TLILTRVNAQ LTGDALLSLE KFSIGGFDTV
RGYTQNKLVA DNGFTASVEV RLPLTANSNA LQIAPFFDIG TVWNNRGSNP QPQTISSLGL
GLLWQPSRDL NLRLDYGIPL TNVNYSGNTL QENGLHFSLR YQPF