Gene Ava_1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1359 
Symbol 
ID3682868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1675409 
End bp1677121 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content43% 
IMG OID637716697 
Productvon Willebrand factor, type A 
Protein accessionYP_321878 
Protein GI75907582 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00054034 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0176573 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTCA AAGGGTCTTG GTTAAATACA CGCCTTGTAG CAGTTTTAGG TGTCCTGTTA 
CTAACCGCTT GTAGCAGTAA CCCCAACTCA ACAGATAATT TTACGGGTTT AAAGATCAAA
GTTTTAGTTG GCAGCGCCTT GGGGGATTTT TGTAACCAAG CTGCAAAAAA TTTTAATGCA
ACGCAACCTA AGTTAGATAA TGGTAATGCT TTACGGGTGG AATGTGAGGC GCAAGGTAGC
GGTGATGTTG TTACTAAGTT GCTAGGATTG ACCACTCAAC TAAAAAACGG CACTTTACAA
CCTGATGGGG CAGATTTTCC CACAATAATT TCCCTGGATG GAGATATATA TCACAGTCAG
TTAATTTACC GAATCAACCA AGTTTTTCCG GGGCAAAATT ACATTCCGGA AATTACCGAT
GCGCCATTGC TGGCTAATAG TCCAATGGTA TTTATGGCAC AGGCGGATGT GGCTGGCGGT
TTGCAGAAAG TACCTGATGC TTATAAGGCT TTAGTGACAG CGAAAACTCA CCGCGATATA
GACCCTGCTT CACCATCGTT AACAGTTAAT TACGTCCACA CTGCACCGAC TCGTTCTAAT
TCGGGGTTGC AAACTTTAGT AGCTCAGTAT ACTAGTGTGT CTGGAAAGCG TCCTGAAGAA
TTAACCATTG CTGATGTGCA GACTTTTCAG CCGCAAATTC AGCAAATCCA AAGTAAGATT
ACTCGTTACG GTGTTTCTAC TAATTCTCTG GCTCAAGCGA TGGTGAAAAA CGGGCCGTTT
TGGGCTTCTG TGGGGTCTGT GTATGAATCG AGTGTGATTG CTGCAAATTC CAGCTTGCAA
CCAGGACAGG AGCGTTATCA GGCAGTGTAC CCCAAGACAA CGTTTACTTC TAATATGCGA
GCAATTGTGC CGAATGCGCC TTGGGTGAGT GCTGATGAGA AGGCTGGTGC AGAGAAGTTT
ATCACTTATT GGCGATCGCC TGATACTCAG AAAATTGCCC CAGATTTAGG TCTGCGACCA
GGAACCCCAG GAGTAGCTTT AGGTGCAAAG TTCTCTCCTG AGTTTGGTGT TGTAGCACAA
GCTAAGTACG ATTCTTTGCG TCCACCAAAA CCAGAGGTAG TAGATGCAAT GTTGAAATCT
TGGCAGGAGG CTTCTAAAAA ACCATCTTTG GTGGTGGTTG TGGTGGATTC TTCAGGGTCA
ATGGAGGGTA ATAAGTTACC AGCCGTCCAA AATACTTTGC AAAATTATAT TAAGAATTTG
GGCAAAAAAG AACAAATTGC TTTGATAGAT TTTGACTCAG AAATTAGAGA GCCTGTCTTA
GTAGATGGTA CTCCCCAAGG ACGCGATCGC GGTGTGCAGT TTATTAGCGG TCTTCGGGCT
GACGGCGGGA CAAAGTTATA TGATGCTGCT ATCCAAGCGC GGAATTGGTT ACAAAAAAAT
CGTCGTCAAG GGGCGATTAA TGCAGTTTTA ATATTAACTG ATGGGGAAGA TTCTGGTTCA
AAAATATCTT TGGACAATCT ATCAGCAGAG TTGCAAAAAA GTGGTTTTTC TACTGACCAA
AGAATTGGCT TTTTTACAGT TGGTTATGGT GAGGAAGGGG AGTTTAATCC TGATGCTTTA
AAGAAGATTG CTGAGTTGAA TGGAGGTTAT TATTCTAAAG GTGATCCTGA GACGATTTCG
CGGTTGATGT CTGATTTACA GGTGGAGTTT TAA
 
Protein sequence
MILKGSWLNT RLVAVLGVLL LTACSSNPNS TDNFTGLKIK VLVGSALGDF CNQAAKNFNA 
TQPKLDNGNA LRVECEAQGS GDVVTKLLGL TTQLKNGTLQ PDGADFPTII SLDGDIYHSQ
LIYRINQVFP GQNYIPEITD APLLANSPMV FMAQADVAGG LQKVPDAYKA LVTAKTHRDI
DPASPSLTVN YVHTAPTRSN SGLQTLVAQY TSVSGKRPEE LTIADVQTFQ PQIQQIQSKI
TRYGVSTNSL AQAMVKNGPF WASVGSVYES SVIAANSSLQ PGQERYQAVY PKTTFTSNMR
AIVPNAPWVS ADEKAGAEKF ITYWRSPDTQ KIAPDLGLRP GTPGVALGAK FSPEFGVVAQ
AKYDSLRPPK PEVVDAMLKS WQEASKKPSL VVVVVDSSGS MEGNKLPAVQ NTLQNYIKNL
GKKEQIALID FDSEIREPVL VDGTPQGRDR GVQFISGLRA DGGTKLYDAA IQARNWLQKN
RRQGAINAVL ILTDGEDSGS KISLDNLSAE LQKSGFSTDQ RIGFFTVGYG EEGEFNPDAL
KKIAELNGGY YSKGDPETIS RLMSDLQVEF