Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0053 |
Symbol | |
ID | 3683562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 61940 |
End bp | 64375 |
Gene Length | 2436 bp |
Protein Length | 811 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637715380 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_320574 |
Protein GI | 75906278 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.710253 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAAAT CTAAGTTTTT AATCTGGAAT ATCAGTATTT TATTGTGTTT ATTAAATTTC ACCGCCGCAA AAGCAGAAAT CATTCCCGAT ACTACACTTC CTAATAACTC TACTGTTAGA CCAGTGAGGA ATATCCGAGT AATTGAGGGA GGTACTCAAA GAGGAGAAAA TTTATTTCAT AGCTTTAGGG AATTTTCTTT TTCTGCCTTA ACTACTAATG TTACAGGAAA TGTTGCTATT TTTAATCACA ATTTGGCAGT ACGAAATATT ATTACTAGAA TTACAGGTGG CTCACCTAGT TATATTGATG GTTTGATTAC AGCTACCTCA GGTAGTAGAG CTAATTTGTT TTTGATTAAT CCTCAGGGCA TAATTTTTGG GGCAAATGCT AGTTTAAATA TTGGCGGTTC TTTTGTGGCG ACTACTGCAA ATAGTATCAA GTTTGCTGAT GGGATAGAAT TTAACGCAAC TACTAATAAT AATACCTCTT TATTAACAGT AAATGTTCCT GTAGGGTTGC AATTTGGTGA TAATCCGGGA AATATTGAAC AGCCAATGGT TGCTAATTTA CCTTTGGAAT TGCAAGTAAG AGATGGCAGA ACTTTAGGAT TAATCGGTGG GAATTTATTA TTAGAAAGTA GTCTTTTGGA AGCGCCAGGG GGCAGGGTTG AGCTAGGTAG TGTTAGTAGA AATGGCTTTG TCAGCCTCAG CCAAATTGAT GATGCTTACG TTGCTGGATA TTCAGGTGTA CAAGATTTTG GAGATATCAA TCTTGCGGCT GGAACTTTAG TTAATAGTGG TAGTGTACCC ATGCAAGATA GTGGTGGTGC AATTCAAATT CAAGGAAGAA ATGTCACTAT TGATGATTCA CTAGTTTTTA CGGTTAACTC TGGTTCGCAA ACAGGAAATA ACCTCGTTGT TAATGCTTCT GAATCTTTAA AAGTAGGTGG CACTTCTAAT ATCTTGACTA TTGCTCAGGC TGAAGGTAAG GCGGGAGATA TTTGGATTAC AGCTAAAGAT TCAGTAGAAT TAAGGGAAGA ATCCTTCATT GGTTCACAAG TCTGTTCTTT AGGTGGAAAT TGTGCCAATG TGACAGGTAA TGGTGGCAAT TTGACCATTG AAACCGGGCG ACTTTTGCTA ACAGATGGTG CGGGAATAGA AGCTTCTACG TTTGGTGCAG GAAACACGGG AAATATTTTA GTTAGAGCTA CAGATTCTAT AGATTTAAGG GGAGAAAGTC CCGATGGTGA TATTCCCAGT GGAATTTTCG CGCAAGTTTC CCCCGATGCT CCCGGAAATG CCGGTAATAC TGGCACAATT ACCCTACAAA CTAGACGGTT GAATATTCAA GGTGGCGCAC AAATATCTAA TGTTTCTAGG TATGGAAGTC AGGGGGGTGA TGTCTTGATT AATGCCTCAG CAGGTATTCA AGTGAGTGGT GCTTCCCAAT TTACTACAGC CTCATCTTTA GATATTTTTC GTAGTGGGAT TTTTGCTGCA ACTCAAGCAG GAACTACCAC CAATGACAGC ACATTAAATA TCAATACTGG ATTACTGACT GTAGAAAACG GAGCCAGGCT AACAGCAAAT AACTTAAATA TTGTAGGCGA TCGCCTAATT GTCCAAAATG GGGCAGAAGT TACCGTGGCT GGAGAGTCGG GGAATTTATC TATCAAGGCG CGATCGCTTA AACTAGATAA CCAAAGTCAA TTAATTGCTC AAACTACCTC CGGTAATGGT GGCAATATCA CTCTCAATCT CAGCGCTATA TTGCGACTAC GCCTGAATAG TCAAATTTCT ACATTTGCTG ATGGTAATGG TGGCAACATC GAAATTAATA GTCCTTTCAT TGTGGCAACT CCCAATGAAA ATAACGACAT CGTTGCCAAT GTGTTTGGGG GTGCAGGGGG TCGGGTGACA ATCAGTACTC AGAAATTTCT TGGTTTAGTT GTGCGTAATC GAGGCGAACT TGAGCAAATT TTAGGGACTA CTGACCCAAA TCAGCTTGAC TCAGGATTTC TACTAACCAA TGACATTGTT ACTTTCTCAC AGACCAGCCC ATCTTTAATA AGTACGGCCA CAATTAACAG GCCGGATGTA GACTCGCGTC AGAGATTTGC TGCGTTCCCC ACAAATCCCA TCAATACATC CAGGCTAACC CAGGTTTGTA GTTCAGATAG TAGGAAAAAA CCACGACTGA AAGACAGTTT TCAAGGAGAA CGGGAAACCA AAAATTCCCC CAGTCCTCAC CATTTGTTAA CAATCAGTAA TTTTACCCCC ATTGATGTTG TGGATAACCC TGATTCCCCT AAAATTGTGG AAGCACAGGG ATGGGTGATC GATACTGATG GCAATATTAC CTTAGTTGCT CCAGTTCCTA CTGTTAATCA TCACTATCCC TGGTTTTCGT CCACCATCTG CCATATTCAT GAATAA
|
Protein sequence | MVKSKFLIWN ISILLCLLNF TAAKAEIIPD TTLPNNSTVR PVRNIRVIEG GTQRGENLFH SFREFSFSAL TTNVTGNVAI FNHNLAVRNI ITRITGGSPS YIDGLITATS GSRANLFLIN PQGIIFGANA SLNIGGSFVA TTANSIKFAD GIEFNATTNN NTSLLTVNVP VGLQFGDNPG NIEQPMVANL PLELQVRDGR TLGLIGGNLL LESSLLEAPG GRVELGSVSR NGFVSLSQID DAYVAGYSGV QDFGDINLAA GTLVNSGSVP MQDSGGAIQI QGRNVTIDDS LVFTVNSGSQ TGNNLVVNAS ESLKVGGTSN ILTIAQAEGK AGDIWITAKD SVELREESFI GSQVCSLGGN CANVTGNGGN LTIETGRLLL TDGAGIEAST FGAGNTGNIL VRATDSIDLR GESPDGDIPS GIFAQVSPDA PGNAGNTGTI TLQTRRLNIQ GGAQISNVSR YGSQGGDVLI NASAGIQVSG ASQFTTASSL DIFRSGIFAA TQAGTTTNDS TLNINTGLLT VENGARLTAN NLNIVGDRLI VQNGAEVTVA GESGNLSIKA RSLKLDNQSQ LIAQTTSGNG GNITLNLSAI LRLRLNSQIS TFADGNGGNI EINSPFIVAT PNENNDIVAN VFGGAGGRVT ISTQKFLGLV VRNRGELEQI LGTTDPNQLD SGFLLTNDIV TFSQTSPSLI STATINRPDV DSRQRFAAFP TNPINTSRLT QVCSSDSRKK PRLKDSFQGE RETKNSPSPH HLLTISNFTP IDVVDNPDSP KIVEAQGWVI DTDGNITLVA PVPTVNHHYP WFSSTICHIH E
|
| |