Gene Ava_3164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3164 
Symbol 
ID3680720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3927057 
End bp3929999 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content40% 
IMG OID637718513 
ProductCna B-type 
Protein accessionYP_323667 
Protein GI75909371 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.592408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0320112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTACG ATCGCCCCCC AATTTCCCCC CCACCAATCA CACCACATTC GTTAGTAAGT 
TCTATTCCCA CACCCCAAGA TAATGATCAT AGCCAAACGG CTGATACATT TAAGTATGCG
ATCGCTACTC CTAGTGAACT AGAAACATTT CCCAATGAGT TCTCTACAGG TGTTTCACCG
CAGCTTTTTA CAACAGAACA TTTTAGTGAG GAACTCATTA CTCAACATAT AGTTAGCGAA
CTACCTGTTG GTAATACGAG TAACAATAAC GTCACCAATA TCCCAGGAAT ATCTACTGTT
GACAATATTA AACAAGCTCC TAAAGCTGAA TCTTCTCCAC CAAATTTGGG TAAACCGGAA
AAAGATAATG GTGTCACAGC ACCAGTAAAT ACAACAGATA CAATTACTTT TACCGAAACT
TCTAGTTCTG GTATAGAAAC TTTATTACTA GGGGTAATTA TTAACCGTAG AGAAGTTGGT
AGTTTAGATG TTATACGTCG AGGAAATACC TTACTAGTAC CTTTAGAAAA CTTTGCTCAA
CTTACAGGCT TAACCATTAC AACAACTGAT GATACTACAG AAATCAAAAC ACCTCTAGGT
GCTGTGAAAC TAGCGTCTAG TGAACTAGAG ATAATTCAGG GAATTACCCA TATTAGCGAT
TCTTTATTAC AAGAAAAACT CAATACCACA ATCGAGTTGA GGTCTTTAGA AGCAGCTTTG
ATTGTTGGGT TACCTTGGTT ATCAGGTAGT AGAGAAGCGC GAAATGCAGC AATTGATCTA
GAACCAGAAG TTAAAGCACC GCTAAGTGGA TTGTCTAGTT TTAGACAGGA ACTTAATGTT
GTTAATGATT CTGGTGGTAC TCGGTTCCAG AGTTATAGTT TGTTGGGTGG GAGACTCGCT
GGCGGGACTT GGCGTATACG TTTAAATAAT AATTTTGAAA ATTCTCCCAA CTTAGCTGAA
TATTTTTTCT ATAAACGTAA TGGTAGATTT CTTTATCAAT TAGGTCGTCA ACAAATAAGT
GTACATCCCC TATTAAATGG GCAAATTTTA ACGGGCGCAC AATTTGGTTA CACTAATTTA
CCCGCAGATC GCTATAGACA AAGTTATAGT GCTAATCAGT TAATACCAAG AAGTTCTCGT
TCTGTACAAA CATTTCGAGG CGAAGTTCCT CCAGCCAGTT TTGTGCAACT GCGAGTTGGT
AGTAGAATAA TTGCCCAACA GCAGGTAGGA TTTGATGGAA GATACGAATT TTTTGATGTA
AATTTGCCTG CTGGACAAAA TAATTTAATT GAAGTTTTGG TTTACGATCG CAACAATTTC
AGTGTACCTA TTGAAATTCG CTCTGTGAGA CTCAACACCT CAGATTTACT ATTACCTCCA
GGTGGTAATG TACAGTTATT GGGTTTGGGT TTTAGTGGTA ACTTAGCTCA AAATGCTTTT
TCTGACGATT ATAATACCTC TGATTCTGGT AGCCTTGTGG GCTTTTATCA ATTTAGGCAA
GGTCTATCAG ATAATTTTAC TGTTGAAGCT GGACTACAAG CAATTCCGAA TACTTTCCAA
ACTCAATTAG GTGCAATTTG GCGTATAGCC AATCCTGTGA TTTTATCGGG TAGCGTGGGT
ACATCTTTTG GTAAGCTTGC TTATAATGCA GATTTAGATG TTCAGTTAGG TGGATTAGAT
ATAAATGCTA ATTCTCAGCT ATTCCCCGAA GGTTATAGAA GTGATAATGG CTCAAGAGAA
AGATATAATC ATAGCTTAGA AGTAGGTTAT CGCTTCAATA GTAATTTGAG GTTGGGAGTT
TTAGCACGTA GTCGCAAATC TGATTCAAAC TCAACTGAAT ACATTGCACC TACTTTCTAT
TTCCGTCCCT CTAATAGTTT ATATTTTACA GGTAGACCTG ACACTGAAGG GCAATATCTG
TTTTATTCGG CATATCAACC TAATGCTTTA ACTCGATTAT CATTTAGTAG CTTTGGTGAT
AACTATGCTA CGGATTTCAG TTATAAATTA AACAATAATT ATCAGTTATC TTTGGGTAGT
GATTTCGGTG GTAATCTACC AGCTCGCTAC CTTGCTACAG TAAATTATTA TCCCAGAAAT
ATTAGAGGAT TTAGTTGGAG GTTAGGGCTT GCTTATCGGG ATGGCGATGT GGGGCCAATT
GTTGGTGCTA GCACCCAATT GATACCGGGT CTATTTGCTA GAGTCGAATA TCAAGCTATA
CCTTCCAGGG CTAGAAGTAA TTTTGGTGGA TGGGGAGATG ATCGCCTGAC AATTTCTCTA
GTATCCGATT TATCATTCTC TGATGGTAGA ATCTCCCCAG GTAACTTTAA CTCTTTTGGG
AAAGATAAGG GAGCGATTTC TGGACGTATT TTTGTCGAAG GAAGTAAAGA AAGCTTTGAT
TTAGGCGAGT CTAGTGTCCG AGTTACAGAC AAATACGGTA AGATTATCGG TGGTGCGAGA
ACCAATGCTC AGGGTAACTT CTTTGTCGGT AATTTGCCAG AGGGTGTTTA CATAGTTGAG
GTCGTTCCCA AAGACTTACC CATAGAACTT ACCCCCCTGA AAACTACCAC AATTGCCGAG
GTTGCGGGTG CGGCGGTAAC TAAGTTGAGT TTCCCTATGA GGGTTGAATA TGGCATTGCT
GGACGCATTA CAGATAGTGC AGGTCTGCCC GTACCGGGTT TAGCGGTAGA ACTTGTTAAT
GCTGAAGGTA AAAAAGTTGG AACAGGCGCA ACTGACGAAT TTGGTCTTTA TCGTGTAGAT
GGTCTTCCGG CTGGTCAATA CACATTACAG GTTCCCAATC AAAGCAACAT CAATCGTAGC
AATAGCCTGC CCAAACGAGC TATAACTATT GAGAACGATT TTGCCTACGA CCAAAATCTG
CAATTACCCA TTTCCGCAGC CACTAAAGAT GCACAGGAAG CACCAAAGTT ACCTAATCCT
TAA
 
Protein sequence
MFYDRPPISP PPITPHSLVS SIPTPQDNDH SQTADTFKYA IATPSELETF PNEFSTGVSP 
QLFTTEHFSE ELITQHIVSE LPVGNTSNNN VTNIPGISTV DNIKQAPKAE SSPPNLGKPE
KDNGVTAPVN TTDTITFTET SSSGIETLLL GVIINRREVG SLDVIRRGNT LLVPLENFAQ
LTGLTITTTD DTTEIKTPLG AVKLASSELE IIQGITHISD SLLQEKLNTT IELRSLEAAL
IVGLPWLSGS REARNAAIDL EPEVKAPLSG LSSFRQELNV VNDSGGTRFQ SYSLLGGRLA
GGTWRIRLNN NFENSPNLAE YFFYKRNGRF LYQLGRQQIS VHPLLNGQIL TGAQFGYTNL
PADRYRQSYS ANQLIPRSSR SVQTFRGEVP PASFVQLRVG SRIIAQQQVG FDGRYEFFDV
NLPAGQNNLI EVLVYDRNNF SVPIEIRSVR LNTSDLLLPP GGNVQLLGLG FSGNLAQNAF
SDDYNTSDSG SLVGFYQFRQ GLSDNFTVEA GLQAIPNTFQ TQLGAIWRIA NPVILSGSVG
TSFGKLAYNA DLDVQLGGLD INANSQLFPE GYRSDNGSRE RYNHSLEVGY RFNSNLRLGV
LARSRKSDSN STEYIAPTFY FRPSNSLYFT GRPDTEGQYL FYSAYQPNAL TRLSFSSFGD
NYATDFSYKL NNNYQLSLGS DFGGNLPARY LATVNYYPRN IRGFSWRLGL AYRDGDVGPI
VGASTQLIPG LFARVEYQAI PSRARSNFGG WGDDRLTISL VSDLSFSDGR ISPGNFNSFG
KDKGAISGRI FVEGSKESFD LGESSVRVTD KYGKIIGGAR TNAQGNFFVG NLPEGVYIVE
VVPKDLPIEL TPLKTTTIAE VAGAAVTKLS FPMRVEYGIA GRITDSAGLP VPGLAVELVN
AEGKKVGTGA TDEFGLYRVD GLPAGQYTLQ VPNQSNINRS NSLPKRAITI ENDFAYDQNL
QLPISAATKD AQEAPKLPNP