Gene Ava_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0053 
Symbol 
ID3683562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp61940 
End bp64375 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content40% 
IMG OID637715380 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_320574 
Protein GI75906278 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.710253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAAT CTAAGTTTTT AATCTGGAAT ATCAGTATTT TATTGTGTTT ATTAAATTTC 
ACCGCCGCAA AAGCAGAAAT CATTCCCGAT ACTACACTTC CTAATAACTC TACTGTTAGA
CCAGTGAGGA ATATCCGAGT AATTGAGGGA GGTACTCAAA GAGGAGAAAA TTTATTTCAT
AGCTTTAGGG AATTTTCTTT TTCTGCCTTA ACTACTAATG TTACAGGAAA TGTTGCTATT
TTTAATCACA ATTTGGCAGT ACGAAATATT ATTACTAGAA TTACAGGTGG CTCACCTAGT
TATATTGATG GTTTGATTAC AGCTACCTCA GGTAGTAGAG CTAATTTGTT TTTGATTAAT
CCTCAGGGCA TAATTTTTGG GGCAAATGCT AGTTTAAATA TTGGCGGTTC TTTTGTGGCG
ACTACTGCAA ATAGTATCAA GTTTGCTGAT GGGATAGAAT TTAACGCAAC TACTAATAAT
AATACCTCTT TATTAACAGT AAATGTTCCT GTAGGGTTGC AATTTGGTGA TAATCCGGGA
AATATTGAAC AGCCAATGGT TGCTAATTTA CCTTTGGAAT TGCAAGTAAG AGATGGCAGA
ACTTTAGGAT TAATCGGTGG GAATTTATTA TTAGAAAGTA GTCTTTTGGA AGCGCCAGGG
GGCAGGGTTG AGCTAGGTAG TGTTAGTAGA AATGGCTTTG TCAGCCTCAG CCAAATTGAT
GATGCTTACG TTGCTGGATA TTCAGGTGTA CAAGATTTTG GAGATATCAA TCTTGCGGCT
GGAACTTTAG TTAATAGTGG TAGTGTACCC ATGCAAGATA GTGGTGGTGC AATTCAAATT
CAAGGAAGAA ATGTCACTAT TGATGATTCA CTAGTTTTTA CGGTTAACTC TGGTTCGCAA
ACAGGAAATA ACCTCGTTGT TAATGCTTCT GAATCTTTAA AAGTAGGTGG CACTTCTAAT
ATCTTGACTA TTGCTCAGGC TGAAGGTAAG GCGGGAGATA TTTGGATTAC AGCTAAAGAT
TCAGTAGAAT TAAGGGAAGA ATCCTTCATT GGTTCACAAG TCTGTTCTTT AGGTGGAAAT
TGTGCCAATG TGACAGGTAA TGGTGGCAAT TTGACCATTG AAACCGGGCG ACTTTTGCTA
ACAGATGGTG CGGGAATAGA AGCTTCTACG TTTGGTGCAG GAAACACGGG AAATATTTTA
GTTAGAGCTA CAGATTCTAT AGATTTAAGG GGAGAAAGTC CCGATGGTGA TATTCCCAGT
GGAATTTTCG CGCAAGTTTC CCCCGATGCT CCCGGAAATG CCGGTAATAC TGGCACAATT
ACCCTACAAA CTAGACGGTT GAATATTCAA GGTGGCGCAC AAATATCTAA TGTTTCTAGG
TATGGAAGTC AGGGGGGTGA TGTCTTGATT AATGCCTCAG CAGGTATTCA AGTGAGTGGT
GCTTCCCAAT TTACTACAGC CTCATCTTTA GATATTTTTC GTAGTGGGAT TTTTGCTGCA
ACTCAAGCAG GAACTACCAC CAATGACAGC ACATTAAATA TCAATACTGG ATTACTGACT
GTAGAAAACG GAGCCAGGCT AACAGCAAAT AACTTAAATA TTGTAGGCGA TCGCCTAATT
GTCCAAAATG GGGCAGAAGT TACCGTGGCT GGAGAGTCGG GGAATTTATC TATCAAGGCG
CGATCGCTTA AACTAGATAA CCAAAGTCAA TTAATTGCTC AAACTACCTC CGGTAATGGT
GGCAATATCA CTCTCAATCT CAGCGCTATA TTGCGACTAC GCCTGAATAG TCAAATTTCT
ACATTTGCTG ATGGTAATGG TGGCAACATC GAAATTAATA GTCCTTTCAT TGTGGCAACT
CCCAATGAAA ATAACGACAT CGTTGCCAAT GTGTTTGGGG GTGCAGGGGG TCGGGTGACA
ATCAGTACTC AGAAATTTCT TGGTTTAGTT GTGCGTAATC GAGGCGAACT TGAGCAAATT
TTAGGGACTA CTGACCCAAA TCAGCTTGAC TCAGGATTTC TACTAACCAA TGACATTGTT
ACTTTCTCAC AGACCAGCCC ATCTTTAATA AGTACGGCCA CAATTAACAG GCCGGATGTA
GACTCGCGTC AGAGATTTGC TGCGTTCCCC ACAAATCCCA TCAATACATC CAGGCTAACC
CAGGTTTGTA GTTCAGATAG TAGGAAAAAA CCACGACTGA AAGACAGTTT TCAAGGAGAA
CGGGAAACCA AAAATTCCCC CAGTCCTCAC CATTTGTTAA CAATCAGTAA TTTTACCCCC
ATTGATGTTG TGGATAACCC TGATTCCCCT AAAATTGTGG AAGCACAGGG ATGGGTGATC
GATACTGATG GCAATATTAC CTTAGTTGCT CCAGTTCCTA CTGTTAATCA TCACTATCCC
TGGTTTTCGT CCACCATCTG CCATATTCAT GAATAA
 
Protein sequence
MVKSKFLIWN ISILLCLLNF TAAKAEIIPD TTLPNNSTVR PVRNIRVIEG GTQRGENLFH 
SFREFSFSAL TTNVTGNVAI FNHNLAVRNI ITRITGGSPS YIDGLITATS GSRANLFLIN
PQGIIFGANA SLNIGGSFVA TTANSIKFAD GIEFNATTNN NTSLLTVNVP VGLQFGDNPG
NIEQPMVANL PLELQVRDGR TLGLIGGNLL LESSLLEAPG GRVELGSVSR NGFVSLSQID
DAYVAGYSGV QDFGDINLAA GTLVNSGSVP MQDSGGAIQI QGRNVTIDDS LVFTVNSGSQ
TGNNLVVNAS ESLKVGGTSN ILTIAQAEGK AGDIWITAKD SVELREESFI GSQVCSLGGN
CANVTGNGGN LTIETGRLLL TDGAGIEAST FGAGNTGNIL VRATDSIDLR GESPDGDIPS
GIFAQVSPDA PGNAGNTGTI TLQTRRLNIQ GGAQISNVSR YGSQGGDVLI NASAGIQVSG
ASQFTTASSL DIFRSGIFAA TQAGTTTNDS TLNINTGLLT VENGARLTAN NLNIVGDRLI
VQNGAEVTVA GESGNLSIKA RSLKLDNQSQ LIAQTTSGNG GNITLNLSAI LRLRLNSQIS
TFADGNGGNI EINSPFIVAT PNENNDIVAN VFGGAGGRVT ISTQKFLGLV VRNRGELEQI
LGTTDPNQLD SGFLLTNDIV TFSQTSPSLI STATINRPDV DSRQRFAAFP TNPINTSRLT
QVCSSDSRKK PRLKDSFQGE RETKNSPSPH HLLTISNFTP IDVVDNPDSP KIVEAQGWVI
DTDGNITLVA PVPTVNHHYP WFSSTICHIH E