Gene Ava_1307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1307 
Symbol 
ID3683050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1606944 
End bp1608200 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content46% 
IMG OID637716645 
Productvon Willebrand factor, type A 
Protein accessionYP_321826 
Protein GI75907530 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTA ATTTGCAGCC TGTCCTCAAT GATGCCAATT TGGATGCACA ACAGCCAAGC 
AGCCAGCGTC AGTTGGCAAT TTCCATCTCA GCTGGTGCTG AACCTCAAGA CCGCACGGTA
CCGCTTAATT TATGTCTAAT TCTTGACCAT AGCGGCTCGA TGAATGGGCG ACCGCTAGAA
ATAGTCAAAC AAGCGGCAAT TCGTTTAGTT GACCGTCTGA AGACGGGCGA TCGCCTGAGT
GTGGTAGCTT TTGATCACCG TGCTAAGGTC TTAGTCCCTA ATCAAGTCAT CGACAACCCG
GAACAAATCA AAAAACAAAT CAATCGGCTG GCGGCGGATG GTGGGACAGC CATTGATGAG
GGTTTGCGTT TGGGAATTGA GGAATTAGCC AAGGGGAAAA AAGAAACGAT TTCTCAAGCT
TTTTTGTTAA CCGATGGGGA AAATGAACAC GGTGATAACA ATCGCTGCTT AAAATTCGCT
CAATTAGCGG CTGGTTATAA CTTGACTTTG AATACTCTGG GATTTGGCGA CAACTGGAAC
CAAGATGTTT TAGAAAAAAT TGCGGATGCA GGCTTGGGAA GCTTGTCTTA TATCCAGAAA
GCCGAACAAG CGGTAGATGA GTTTGGTCGT CTGTTTAGTC GGATACAAAC GGTGGGTTTG
ACAAATGCTT ATTTGCTATT GTCTCTTGCG CCTAATGTCC GCCTGGCAGA ACTGAAACCC
ATTGCCCAAG TTGCCCCAGA CACAATTGAG TTACCCCTGC AACAAGAAAC AGATGGCCGT
TTTGCTGTTC GCTTGGGTGA TTTGATGAAG GATGTAGAAA GGGTGATTTT GACTAATATT
TATCTGGGTC AGTTGCCAGA GGGTAAACAA CCCATTGCCA ATGTGCAGAT CCGTTATGAT
AACCCAGCAC AAGACCAAAC CGGGTTATTC ACTCCCAATA TCCCCGTTTA TGCCAATGTT
GTTAGGGCGT ACCAACCAGC TATTAATCCC CAGGTGCAAC AGTCGATTTT AGCTTTGGCT
AAGTATCGAC AAACCCAGCT AGCAGAAGCC AAATTGCAAC AAGGCGATCG CGTGGGTGCG
GCTACTATGT TGCAAACTGC TGCCAAAACT GCCCTGCAAA TGGGTGATAC AGGCGCGGCG
ACAGTTTTAC AAACCTCTGC AACTCAATTA CAGGCTGGGC AAGATTTATC AGAAAGCGAT
CGCAAGAAAA CCCGGATTGT CTCCAAAACC GTCTTGCAAG ATACCCCTCC CAAATGA
 
Protein sequence
MKVNLQPVLN DANLDAQQPS SQRQLAISIS AGAEPQDRTV PLNLCLILDH SGSMNGRPLE 
IVKQAAIRLV DRLKTGDRLS VVAFDHRAKV LVPNQVIDNP EQIKKQINRL AADGGTAIDE
GLRLGIEELA KGKKETISQA FLLTDGENEH GDNNRCLKFA QLAAGYNLTL NTLGFGDNWN
QDVLEKIADA GLGSLSYIQK AEQAVDEFGR LFSRIQTVGL TNAYLLLSLA PNVRLAELKP
IAQVAPDTIE LPLQQETDGR FAVRLGDLMK DVERVILTNI YLGQLPEGKQ PIANVQIRYD
NPAQDQTGLF TPNIPVYANV VRAYQPAINP QVQQSILALA KYRQTQLAEA KLQQGDRVGA
ATMLQTAAKT ALQMGDTGAA TVLQTSATQL QAGQDLSESD RKKTRIVSKT VLQDTPPK