Gene Ava_2908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2908 
Symbol 
ID3681413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3613066 
End bp3615249 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content45% 
IMG OID637718253 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_323414 
Protein GI75909118 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000198552 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCCAC CAATTGTTAA GAGCTATATT ATTGCTTTTG AAAAATATAA ATGGATTGGA 
TTAGCCAGCT TTGCTTTAGT TGTAGCGGGG TCAACAGTGG TGGCTATCCA ACCAGAACCA
GCACCTACTT ATACAGCAAG CGGCACACTG ACATATTCTC GACCCCCGGT ATCATTTTCT
ACGACAGGTA GCGAAATTCA GCAACAAGGT CAGGAATTAA ACCAGGAAGT TCTGCTATCA
GATCAAGTAA TTGATAATGT GTCGGCAAAG GTGAAGATTC CGTCAAGAAG AATTGCTTCC
AGCGTAGCGA TTACACCGCC GAGAAGAAAC TCACGTACTG GAGAATTAGA ATCTAATGTC
ATTTCTCTGG CATATAAAGA TAGTGATCCC AAGCGGGCGC AGGAAATATT GCTGGAGCTG
ATGCAGGCAA TGGTCAAGCT CAGTAGTGAT ATTAATACTG GGCGACTAAA AGCTATTATT
GGCAAAATCA ATGAACGGAT ACCACAAGCA AGGTCTGAGC TACAAGCAGC CGAAAAAAGA
TTGGAACAGT ACGATCGCCG GGAAAGACCT GCTATATTAG CCGCAGAAAA TGGTAGTTTA
CTTGGTGCTG TTACTGGTAG CCAAAATCAG CAACGGGTAA TTCGCCAAAC TATTACGGGA
ATTGAGGCGC AGATTTTCAG CTTGCAAGAT AAGTTGGGTT TAGATGTCGG TCAATCTTAT
GTTTCTTCGG CGTTGAGTGC TGACCCAATT ATCGCCAACT TGCGATCGCA AATTTATCAG
ACAGAATCAC AAATCGCTTT ACTTCGTAAA GATTTGCGCC CAGAACACCC AAATATGATT
CAGTTGCAGC GTCAAAAACA AGCATCTGAA GAATTACTCC AACAAAGGGC GGCGGAAGTA
ATTGGCGGTG GTGGTACAGC AGCACCCTTA GCAACAAATA TTAGTGGTAT CCGCGCCCAA
AGTAGTTTAG ATCCAGCCCG ACAGCAGCTA GCTAACCAGA TGGTAGCTTT GCAAACGCAA
AAAGAAACCC TACAACAACA ATTAGCCCAG CAAATACGAG ATGAAATCCG GCTGCGGCAA
GAATATTCCC AAATACCCAA CAAACAATTA GAGCGATCGC GCTTAGAACA AGCAGTAGGG
CTGAAGAAAG CAGTTTATGA TCAAATGCAA GCCAAGCTCA CCGATGCTCA AACAGCCGAA
GCAGAGACAG TAAGCAGCTT TAGCATTGCT CAACCCCCCG TGGTAGCGGC TGATGCCAAA
AAACCCAAAA GTGTACCTTT AACCTTGGGG GTAGGTGGTT TCTTAGGATT GATAGTCGGT
GGTGGGGTGA TTTTCTTACT AGGTTCCTTG GAAGGAACAC TCCGCACCAG AGAAGCTATC
AGAGACAGCC TCAAACAACG GGATGTGGCA ATGTTGGGAG AAATACCTGT ATTGCCAGTG
GATGATTTAC CACCGGAAGC TCTACCTGTC ATCCTTTCTC TAGATTCTTT GTATTTAGAG
TTTTATGAGA AATTACGCAG TAACTTAAGG CGCATTGGTG GTAGAAACTT AAAAGTGGTT
TTGGTAACTA GTACTAGTAG CCAGGAAGGT AAAACAACCA GCGCTTATAA CCTGGGTATA
GCCTCCGCCC GCGCTGGCAA AAGAACCTTG ATCATTGAGA CAGATTTGCG ATCGCCTTCA
CGTTCCACCT CTTTGAGAGT TTCTCCCGAC GAGGATGCCA CACTTGAACC CCTGCGTTAT
TATGGCAGCT TAAGTGAATG TATTCGCTTA GTCCCTGAAG TCGAAAATTT ATACATCATC
CCTAGCCCTG GGCCTGTGCG TCAATCTGCC GCTATTTTAG AATCTAGCGA AATGCGGCGG
CTCATGGAAG ATGTGAGAGA ACGCTATGAT TTAGTTATTT TAGATACTAG TCCGTTAAGT
GTATCTAATG ACCCCTTGTT AATTCAACCC TATAGTGATG GCATAGTACT CGTATCACGA
GTTAACTATA CACAAGACAG TATGATGGCT GAAGCCATTG ATCAACTAGT GGAAGCAGAA
CTAGGACTAG TGGGAGTTAT TATCAACGGG GCTGATATCA CCGTTTCTTT ACCACCCTCG
GCTGAATTCT CCGATTCAGT ACCTGGGGAA GAACGGACAA GAGATGAAGA GTCAGAAATT
TCTGTTGGTG TAAGCAATAA CTAG
 
Protein sequence
MTPPIVKSYI IAFEKYKWIG LASFALVVAG STVVAIQPEP APTYTASGTL TYSRPPVSFS 
TTGSEIQQQG QELNQEVLLS DQVIDNVSAK VKIPSRRIAS SVAITPPRRN SRTGELESNV
ISLAYKDSDP KRAQEILLEL MQAMVKLSSD INTGRLKAII GKINERIPQA RSELQAAEKR
LEQYDRRERP AILAAENGSL LGAVTGSQNQ QRVIRQTITG IEAQIFSLQD KLGLDVGQSY
VSSALSADPI IANLRSQIYQ TESQIALLRK DLRPEHPNMI QLQRQKQASE ELLQQRAAEV
IGGGGTAAPL ATNISGIRAQ SSLDPARQQL ANQMVALQTQ KETLQQQLAQ QIRDEIRLRQ
EYSQIPNKQL ERSRLEQAVG LKKAVYDQMQ AKLTDAQTAE AETVSSFSIA QPPVVAADAK
KPKSVPLTLG VGGFLGLIVG GGVIFLLGSL EGTLRTREAI RDSLKQRDVA MLGEIPVLPV
DDLPPEALPV ILSLDSLYLE FYEKLRSNLR RIGGRNLKVV LVTSTSSQEG KTTSAYNLGI
ASARAGKRTL IIETDLRSPS RSTSLRVSPD EDATLEPLRY YGSLSECIRL VPEVENLYII
PSPGPVRQSA AILESSEMRR LMEDVRERYD LVILDTSPLS VSNDPLLIQP YSDGIVLVSR
VNYTQDSMMA EAIDQLVEAE LGLVGVIING ADITVSLPPS AEFSDSVPGE ERTRDEESEI
SVGVSNN