Gene Ava_2811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2811 
Symbol 
ID3681647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3477221 
End bp3480643 
Gene Length3423 bp 
Protein Length1140 aa 
Translation table11 
GC content39% 
IMG OID637718157 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_323319 
Protein GI75909023 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0282223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.417754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAC GAAAAAATCT AAATTGGGTT ATTAACTTAG GTTTATTATT TAGTTTGGTT 
GTGGCAACAA ATAACCGAAC TTCGGCACAA GTCACAGCCG ACCAAACCTT AGAAACACAA
GTTATAGATA TTGGTTTCAA CTCCTTTATT TTAGGTGGCA CAACAGTTGG CAACACTAAT
TTATTCCACA GTTTTGCTAG TTTTAATGTT CCTAGTAATG GTGCAGCAAT TTTTATCAAT
GACCCTAGTT TAACTAATAT ATTTGCACGA GTAACAGGTG GCACAGCTTC CGATGTTCAA
GGCAGAATAG GTACTCAAGG GACTGCCAAT TTATATTTAA TCAATCCCAA CGGCATTATT
TTTGGTACAA ATGCTAGCTT AAATATAGGT GGTTCTTTTG TCGCTACCAC AGCTAATGCT
ATTCAATTTC CTGGTGGTGC TGAATTTTCT TTAAATTCAT CAGTTTCACC AGATAATCAT
CTACTTAGGG TCAACCCCAC AGCATTCTTA TTTAATCAAA TCGCTAATCA AGGTATAAAC
TCCATTGAAA ATCGCGGTTC TTTAGGAGTT CCTAATAATA AGAGTTTAAT TCTACTTGGT
GGTAATATTG CACCCACATC TCATGCCACT GGGGGAATTT TGCTGGATAA TGGAAGACTA
GGAGCATTAG ATGGGAGAGT CGAAATTGGC GGATTAACTG CACCTGGCTC CGTAGGAATA
AATGTTAAGG GTAATAATTT AAGTTTAAGC TTTCCGCAGG GAGTAGCTAG AACAGATATT
GCATTAATAA ATAATACTGG AGTTCTTGTT GCTGGTGTTC GTGGTGGAGA TGTTTTAGTC
AACGCTAATA ATTTAAATTT ACTAAATTCT ACTAATATTT TTACTGGCAT ATTTGCTAAT
CAAGGAAAAC CTGATAGTAA AACTGGGGAT ATATCCATTA ATGCAACGGG AATTGTCAAC
CTAACTGAAG GTAGTGGTAT TGGAAATTTA TCACTAGGGA TAGGAGATAG CGGCAATATC
AATATTGTTG CTCAATCACT AAATATGAAG AATGGTAGCA GCATTCAAGC GATTACTGCA
CAGGGTGAAT CAGGAATTGT TAATATTAAA GTTACTGATA CACTTTCCCT AAACCAGGGA
CTCCTTGCTA ATGTAATTTT AAGTGATAGG ATTAATATCC AAGCACAAAA AGTTTCCCTT
GCAAATGGCG CTTTAATCCG TGCTTTTAAT TCATCAGGAA ATGATTCTGT AGGTATTAAA
ATTCAAGCTT TTGATTCCCT TGAATTGAAC AATGCTAGCA TTCTTACTGG TACTTTTGAA
AACGGTAATT CAGGTGATTT ATCAATTATT ACCAATCGCT TAAATCTAAA AAATGATTCT
CAAATAGCAG CTAATACATT TGGCAGTGGC AATGCTGGCC AAATTAATAT CAATGCTGCT
GATATCGCTA TAGACTTATC TTCAATTAGT GCGGCTTCAT TATCTTTCCC AGGAGTTATT
ACTAATATAG GTAATGCTGG GACTATCAAT ATCCAAACGG AAAGAATTAG TCTCACAAAT
CGTGGTTTCA TATCATCTAG TTCTGACGAA CCAGAGCAGG GAACAACAAT CGGTTATGGC
GGAAATATTA ATATCACAGC TAATGAATTA ATAGAAATAG ATGCAAAAGG TGCAACAGGT
ATAATTACGG GTTTAGTTGC CAGAACTTTT AGTGGCAGCA GTGCTGGTGA CATAAACCTG
AATACCAGAA ACCTGATTGT TAGAGATGGT GGTTATATTA GCGTAGAAGC TGCTAATAAT
TTAGGGGGAA ATGCTGGTAA TATTACTATC AACGCATCAG ATGCTGTTGA GTTAACTAGC
TCCAACGGCA ACCGCCCTAG TATAATTTTC ACTGGTGTTT CAGCTTTTAA TGATAATGCG
ATCGCTGTAG GTAACGGTGG TGACTTAAAA ATTATTACTC AAAATTTACA GCTAACAAAC
TCCACTATAT CAGCCGGGAC TTTCGGTCAA GGTAATGCAG GTAATATCAA TATCTTCAGC
AATGATGGGG TGGTAATCGA TAATAGTGTG ATCAATAGCC GAATTGGCGC TGGTGCAGTT
GGGAATGGTG GTGATATTGA CATCCAGACA CAGAGGCTAA CTTTAACTAA TGGTGGTGGG
ATTGATACTA TAGTTGCCCC ACCAGAAGGA AATTTACCTG GCGGACTAGG TAAAGGTGGA
ACTATTCGTA TCAATGCTAC AGATGCAGTC ACTATTTCAG GAACTAATGC AGATGGTTTT
GAGAGTGCAA TCTTGACGGA AACCCAAAGC GGTGCATTTG GTCAATCTGG CGATATCATC
ATTAATACAA ATTATTTCCG TGTCTTAGAC AACGGTAGTG TATCGACATT AACAGGAAAT
AGTAGTGATG CAGGTAATAT TTTGATCAAT ACCCGCGTAA TGGAGGTGTT GAATGGTGGA
GTAATCGTGA CAGGCACTGA GGCTAGTGGT AAAGCCGGTG ACATTACCAT TAATGCTACA
GATCAAATTA ACATCTTGAG CAACACCGAC TTAAATGGTA ATACTGGTTT GGTTGCTAGC
ACTACTTCTA GTGGAGATGC GGGGAATATA TCCCTATTTA CAACTGACCT TAACTTATTA
GCTCCTAATT CTAACTTGCC TGCTCCCGTC CTTGTGGCCA CATTAAGTCG AGGAAGCGGG
ATTGCCGGGA ATATTAATAT TGTTGCCACA GGCAATTATA ATGCCAATAA TGGCTTAATC
AGCGCACGAT CTGAAAGAGC CGGTGGTGGA AATATCAGTG TCAATGCTAG AAATATCAAC
CTACGGAATA ACAGCGACAT TCGTACAGAT TTATCTAGTG GTAACGGTAG AGGTGGGGGA
ATTTCCCTCA CCGCAGATAT TATCATTTCC TTGGAGGATA GCGATATTCT TGCCTTTGCA
CCAGAGGGAC AAGGCGGAGA TATTAAATTT AACACCCGCG CTGTATTTAG CAATGTTCTC
TACACGGCTA GACAAACAAT TCCTGACAGA AATAGTCTTC AGTCACTAGT TAATAATACT
CGCCCTGACA TTAACGCTAC AGGGACAATC TCTGGCAACA TTATTGGTGT ACCTGATATT
AGCTCTATCC AAAACGGACT CACAGATTTA CAAGCTAATC CTATTGATAC TACCGTACTG
ATTGCTAATA GTTGCATTGC TCGCAGTCCT AGACAAGAAG GAACTTTTAT TATTACCGGG
ACTGGTGGTT TACCAACTCG TCCTGGTGAG CTTAAGGCTT CTAGTTATGC CACAGGAGAT
GTGCAGAGTG TCAGTAATAA TATTGTAGCT AGTGCATGGA AGAAAGGCGA CGCTATTATT
GAACCACAAG GAGTATATCG ACTGGCGAAC GGGGAGATGG TGATGAGTCG TGAGTGTGGT
TGA
 
Protein sequence
MIKRKNLNWV INLGLLFSLV VATNNRTSAQ VTADQTLETQ VIDIGFNSFI LGGTTVGNTN 
LFHSFASFNV PSNGAAIFIN DPSLTNIFAR VTGGTASDVQ GRIGTQGTAN LYLINPNGII
FGTNASLNIG GSFVATTANA IQFPGGAEFS LNSSVSPDNH LLRVNPTAFL FNQIANQGIN
SIENRGSLGV PNNKSLILLG GNIAPTSHAT GGILLDNGRL GALDGRVEIG GLTAPGSVGI
NVKGNNLSLS FPQGVARTDI ALINNTGVLV AGVRGGDVLV NANNLNLLNS TNIFTGIFAN
QGKPDSKTGD ISINATGIVN LTEGSGIGNL SLGIGDSGNI NIVAQSLNMK NGSSIQAITA
QGESGIVNIK VTDTLSLNQG LLANVILSDR INIQAQKVSL ANGALIRAFN SSGNDSVGIK
IQAFDSLELN NASILTGTFE NGNSGDLSII TNRLNLKNDS QIAANTFGSG NAGQININAA
DIAIDLSSIS AASLSFPGVI TNIGNAGTIN IQTERISLTN RGFISSSSDE PEQGTTIGYG
GNINITANEL IEIDAKGATG IITGLVARTF SGSSAGDINL NTRNLIVRDG GYISVEAANN
LGGNAGNITI NASDAVELTS SNGNRPSIIF TGVSAFNDNA IAVGNGGDLK IITQNLQLTN
STISAGTFGQ GNAGNINIFS NDGVVIDNSV INSRIGAGAV GNGGDIDIQT QRLTLTNGGG
IDTIVAPPEG NLPGGLGKGG TIRINATDAV TISGTNADGF ESAILTETQS GAFGQSGDII
INTNYFRVLD NGSVSTLTGN SSDAGNILIN TRVMEVLNGG VIVTGTEASG KAGDITINAT
DQINILSNTD LNGNTGLVAS TTSSGDAGNI SLFTTDLNLL APNSNLPAPV LVATLSRGSG
IAGNINIVAT GNYNANNGLI SARSERAGGG NISVNARNIN LRNNSDIRTD LSSGNGRGGG
ISLTADIIIS LEDSDILAFA PEGQGGDIKF NTRAVFSNVL YTARQTIPDR NSLQSLVNNT
RPDINATGTI SGNIIGVPDI SSIQNGLTDL QANPIDTTVL IANSCIARSP RQEGTFIITG
TGGLPTRPGE LKASSYATGD VQSVSNNIVA SAWKKGDAII EPQGVYRLAN GEMVMSRECG