Gene Ava_0094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0094 
Symbol 
ID3683447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp126211 
End bp128727 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content48% 
IMG OID637715421 
Productsurface antigen variable number 
Protein accessionYP_320615 
Protein GI75906319 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR00992] chloroplast envelope protein translocase, IAP75 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.39497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.155801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTAT CTCCTGTATT GGTGGCGGCT GTAGCAATCA CAGCACCTTT AAGTAGTTCT 
TTAACTGCAA ATGCCCAAAC TCCTACAAAT TCAGAACAGA CATTAGAGGC GATCGCCCCA
GCGACAAATC AGCAGTCAGA ACAAAATGTG AGTTTAGAAT CCCCAGAGGA TTTCACAACT
GTCAAGGCGC TGGCAGATAT GCAATCCCCA TCAGGGGAAT TATCGACAGA TAAGTCAACT
AAATCCGCAG CAACGACGAA AAAAGATGTG ATTGTACCCA CTGTAGAGAC ATTAGTGGCT
ACAAATCCAC CCATGCAAGA GAGCAACTCC GCCAGTAAGA TAGCGCCTGT AGAAATTGGC
AAACCAACAT CGAGCGCGGT TTTTAGGAGT CAGTATCAGA CAATAAATTA TGGTAATTAT
GGGCGATCGC TGACATCAGG TGTTAGCTCA TCACCATCAC CACAGAAAAA AGCCAATTTA
CCCTCCCCAG TTCCTAACCC CCTAGACACC GGAGCGACAA CAGCCAAGCA ACTTGTACAA
GCCCCAGAAC AACCTGCACC CCAACCAGAA GTAGCCCCCC CAAGCACCGA GGAACCAGCC
CCAGCCCCAG AAACTACTCC AGGGACAGAA AATTTCAACA CTCCTAACGC CACACCTGAA
ACCACAGAAC CCCGTGTATT AGTTTCAGAA GTCCTCGTGA GACCGCAATC AGGACAACTA
ACTCCCGAAT TAGAAGCCCA AGTTTACAAC GTAATTCGCA CTCAACCAGG ACGGACAACC
ACCCGTTCCC AGTTACAAGA AGATATTAAT GCCATCTTTG GCACAGGCTT TTTCTCCAAC
GTCCAAGCAT CACCAGAAGA CACACCATTA GGGGTGCGAG TCAGCTTTAT CGTCCAGCCC
AACCCCGTCT TAAGCAAAGT AGAAATTCAA GCCAATCCTG GTACTAACGT TCCCTCAGTA
CTGCCCCAGG CTACTGCTGA TGAAATTTTC CGCCCCCAGT ATGGCAAAAT TCTCAACTTG
CGGGATTTAC AAGAAGGGAT TAAAGAATTA ACCAAACGTT ATCAAGACCA AGGTTATGTT
CTCGCCAATG TTGTAGGCGC GCCCCAAGTT TCCGAAAATG GAGTTGTCAC CCTGCAAGTA
GCTGAAGGAG TTGTTGAGAA TATTAGCGTC CGCTTTCGCA ACAAAGAAGG ACAGGATGTC
AACGAACAAG GACAACCAAT TCGGGGACGG ACACAGGACT ATATCATCAC GCGAGAAGTG
GAATTGAAAC CAGGACAGGT ATTCAACCGC AACACCGTCC AGAAAGACCT ACAACGGGTA
TTCGGGACAG GATTGTTTGA AGATGTCAAC GTTTCCCTTG ACCCCGGTAC AGACCCCACC
AAGGTGAATG TGGTGGTAAA TGTGGTAGAA CGTAGCAGTG GTTCAATTGC TGCTGGTGCT
GGTATCAGTT CTTCTAGTGG GTTGTTTGGT ACAGTCAGCT ATCAACAGCA AAACCTTAAC
GGCAGAAACC AAAAACTAGG CGCAGAAGTA CAACTTGGCG AACGAGAATT GTTGTTTGAC
CTCCGGTTCA CAGACCCTTG GATTGGTGGT GATCCTTACC GTACCTCTTA CACAGCGAAT
ATTTTCCGCC GTCGTTCCAT CTCCTTGATT TTCGACGGGA AGGACGAAGA TATTAGAACA
TTTGACCCTG GTAATCCTAA TGATACAAAC GAGCAAGACC GTCCTCGTGT CACTCGTCTA
GGTGGTGGTG TTACCTTCAC CCGCCCTCTA TCAGCTAATC CCTTTGAAAG AGCAGAATGG
ACAGCCTCAG CAGGCTTGCA GTATCAGCGA GTTAGTACCC GTGATGCTGA TGGCAACTTG
AGAAAAGAAG GTGCTGTATT CGACGATAAT GGCAACCGTA CCAGTGAACT CGTTCCCCTC
AGCTTTTCTG GGACGGGAGA AGATGACTTA TTATTAGTGC AACTAGGCGC ACAGCGTGAC
CTCCGCAACA ATCCCTTGCA GCCCACTAGC GGTTCTTTCT TACGTTTCGG CGTAGATCAA
TCAGTACCAG TAGGCTCAGG TAATATTTTC CTCACTCGGT TCCGGGGTAG CTATAGTCAA
TATCTCCCCG TTAAATTTAC AGGTTTTAGT AAGGGGCCGG AAACTATAGC CTTTAACATC
CAAGGCGGCA CAGTCCTTGG TGATTTGCCC CCCTATGAAG CTTTTACCCT TGGTGGTAGT
AACTCAGTAC GTGGTTATGA AGAAGGTGCT TTAGGTAGCG GACGCAGCTT TGTGCAAGCA
TCTGTTGAGT ATCGTTTTCC TGTTTTCTCT GTAGTTAGTG GCGCTTTATT TTTTGACGTT
GGTAGCGACT TGGGAACCAG TACCAGAACT GCTGAAGTGC TGAATAAAAG CGGTAGCGGC
TACGGTTATG GTCTTGGTGT GCGCGTGCAA TCACCATTGG GGCCAATTCG TATTGACTAT
GGTATCAACG ATGACGGTGA TAGCCGCATC AATTTCGGTA TTGGCGAAAG GTTTTAG
 
Protein sequence
MRLSPVLVAA VAITAPLSSS LTANAQTPTN SEQTLEAIAP ATNQQSEQNV SLESPEDFTT 
VKALADMQSP SGELSTDKST KSAATTKKDV IVPTVETLVA TNPPMQESNS ASKIAPVEIG
KPTSSAVFRS QYQTINYGNY GRSLTSGVSS SPSPQKKANL PSPVPNPLDT GATTAKQLVQ
APEQPAPQPE VAPPSTEEPA PAPETTPGTE NFNTPNATPE TTEPRVLVSE VLVRPQSGQL
TPELEAQVYN VIRTQPGRTT TRSQLQEDIN AIFGTGFFSN VQASPEDTPL GVRVSFIVQP
NPVLSKVEIQ ANPGTNVPSV LPQATADEIF RPQYGKILNL RDLQEGIKEL TKRYQDQGYV
LANVVGAPQV SENGVVTLQV AEGVVENISV RFRNKEGQDV NEQGQPIRGR TQDYIITREV
ELKPGQVFNR NTVQKDLQRV FGTGLFEDVN VSLDPGTDPT KVNVVVNVVE RSSGSIAAGA
GISSSSGLFG TVSYQQQNLN GRNQKLGAEV QLGERELLFD LRFTDPWIGG DPYRTSYTAN
IFRRRSISLI FDGKDEDIRT FDPGNPNDTN EQDRPRVTRL GGGVTFTRPL SANPFERAEW
TASAGLQYQR VSTRDADGNL RKEGAVFDDN GNRTSELVPL SFSGTGEDDL LLVQLGAQRD
LRNNPLQPTS GSFLRFGVDQ SVPVGSGNIF LTRFRGSYSQ YLPVKFTGFS KGPETIAFNI
QGGTVLGDLP PYEAFTLGGS NSVRGYEEGA LGSGRSFVQA SVEYRFPVFS VVSGALFFDV
GSDLGTSTRT AEVLNKSGSG YGYGLGVRVQ SPLGPIRIDY GINDDGDSRI NFGIGERF