Gene Ava_4725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4725 
Symbol 
ID3679705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5910148 
End bp5913261 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content43% 
IMG OID637720081 
Productpolymorphic membrane protein 
Protein accessionYP_325217 
Protein GI75910921 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.708929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTA CTATTTTAAC AGTCAATACT ACAACCGATC AAAATGATGG CAGTGCTGCT 
AATGGTTTAT CTCTGCGAGA TGCGATTTTA ATTGCTAATG CTAATCCGAA TACTGAGTAT
GAAATTCGGT TAACTGGCGG TGTAACTTAC AATCTTACGT CGAACGGGAT TAATGAAGAT
AATGCCCTCA CAGGTGATTT AGATATTAAG AGCCGTAATA ATGTACTTTA TATTGTCTCA
GTTGGTGGTG AGAAAGCGAC TATTGATGCG TCTGGTTTAT TAAATAGCGA TCGCGTTTTT
CATGTCCTCA ATGGTGGTGC TTTAAGTTTA CAGAATGTAG TGGTTACGGG GGGCAAGATA
TCTAATGATG GTGGAGGTAT TAGAGTTGAT TCTAATGGCT ACCTAGATTT GTACAACACT
AACGTCAGTG GCAATAGTGC AGAGGGTGGG AACTGGGGTG GTGGTATTTA TAACAACAAT
GGTACAGTCT ACTTACGAAA TGGTTCTACT ATCAGTAACA ATCAAGCCCT CAACGGTGGC
GGTATCCTTA ACTCTGGCAC TCTGATTACG ATTGACTCCA CCATAAGTAA TAACAACTCA
GGTAGTGGAG GTGGAATCTA CAACTTTGAT ACATTGACGG TAATCAACAC TACAGTTAGT
AATAATAGTG CAAGAGGCAG TGGTGGCGGT ATTCAAGGTA ACGGTAGTTT TTCCTCTATT
GCTTTAGTTA ACACAACAAT TAGTGGAAAC ACAGCAGGAA GCGGTGGGGG TGGTATAGAT
TCTACTGGGG GCAGCGTCAC AAATATACTT AACAGCACGA TTACCAATAA TACAGCAGGC
ATTTTAGGAG GAGGTGGAAT TAGAGGCAGT GCCAATTTAA AGAACACAAT TGTTGCCGGA
AATTTTGGTA ATTATGATTA CCAAGGGACA GGCAAAGATA TTCAAGGAAC AGTTAACGGT
AATAATTATA ACCTGATTGG CTCTCTAGCT GGAGCCAGTG GTACTGTAGG CACAGGTACA
GATATTGTCA ACCCCAACCC CGGACTTGGC CCCTTACAAA ATAACGGTGG ACTCTCTCTC
ACCCACACCC TGCTTGCTGG TAGTCCTGCT ATAAATGCCG GTAACAATAA TTTAATTCCG
GCTGATGCAG AAGATATAGA CCGTGATGGC GACACGACAG AACCCACTCC CTTTGACCAA
AGAGGCTTGG CACGAGTTAG CGGTGGTACA GTAGATATTG GGGCATTTGA GGTACAATCT
GCTACCCTGC CAACCATCAC CATCAACAAT GTTACCGTTA CTGAAGGCAA CACAGGGACA
ACCAATGCCA CTTTTACCGT CACTCTCTCC GCCGCCAGTA CTTCTGCTGT TACGGTTAAC
TATGCTACAG CTAACGGCAC AGCGACAGCA GGTAATGATT ACACCTCAAA CACAGGTATC
TTAACTTTCA ACCCAGGGGA AACCAACAAA ACTATCACTG TTGCGGTGAG AGGCGATACT
ATAGCCGAAC CGAATCAGAC ATTTTTCCTG AACCTCAGCA ATTCCCTTGG CGCTACCATT
GCTGATAACC AAGGATTGGG AACTATCACC GATGATGATG CTAATCCTAC AATTGCTATT
GCTGATGTGA GCCGGAATGA AGGTAATAGC GGTACAAGTA ACACCACATT TACTGTCACT
CTATCGGCTG CCAGTGAAAA GACAATTACA GTTAATTATG GGACAGCTAA TGGTACAGCG
ACAGCCGGAA GTGACTACAC ATCTACAACA GGTACATTAA CTTTCAATCC AGGGGATACG
AGTAAAACCT TCACTGTTGC TGTAACTGGT GATACAACAG TTGAAGCCAA CGAAACCTTC
TTCGTTAATC TCAGCAACGC CACTAACGCC ACCATCACTG ATAATCAGGC GTTAGCAACT
ATACTCAATG ATGACACAGC CACTCTTCCC ACTCTTTCAA TCAACGATAT CACCATTGTT
GAAGGGCAAA GCAGTCAAGC AGTCTTAACT GTTACCCTCA GCAGTGCTTC TAGTCAACCA
GTCAGCGTTA ACTACTCTAC AGTAGCTGGT ACTGCTACAG CTAATACAGA CTATACCAGC
CGCAGTGGTA CTGTAACTTT TGCTGCCAAT ACTACCACGG CGACAATTAC AATTCCGATC
CTGAACGATA ATCTCAATGA AGCTAACGAA ACCTTGAGAG TTAATCTTTC TAGCCCTACC
AATGCAACTA TACAAAAAGC TTCTGGCACT GTCACTATTA CTGACACTTT GCAAGCTAGT
GTAACCACAA CCTTAGCGGA GGGGATTGAG AATTTAACTT TACTTGGCAG TAGCAATATT
AACGCTACTG GTAACAGTAG TAATAATACA TTGACGGGGA ATAGTGGGAA TAATATTCTG
ACTGGTGGTA ATGGGGATGA TACCTACAAT TTCAATGCTT CCACACCATT AGGTAGCGAT
CGCATTCAAG AAATTACCAC AGGCGGCAAT GATACGATCA GTTTTGCTGG TACAAATAAC
GCCGCACGCG TAAATTTAGG TACTATTGCC ACCCAGACCA TCAACAGTAA CTTAAAGTTG
ACCCTATCTG CTAATAATGT GATTGAAAAT GTCGTTGGTG GTAATGGCAG CGATCGCCTC
ATTGGCAATA GCCTCAATAA TACTCTGACT GGTGGCAATG GTAATGATGT GTTAACAGGT
AGAGGCGGGG CAGATACTTT GATTGGCGGG TCTGGCAATG ATATCTTGAC AGGTGGAACT
GAGAGCGATC GCTTTTTATA TAGTAGTGGA CGAGCTTTTA CTAGTAACGA TTTTGGTATA
GATATCCTGA CTGACTTTAC TTCTGGAAGC GATAAGCTGG TTTTGAGTAA AAGGACATTT
ACGGCTTTAA GTAGTGTTAT TGGTGATGGA TTGAGTCAAG TATCTGATTT TACCACTGTC
GAGGATGATG ATTTAGCCGC AACCAGTACA GCATTCCTCG TTTACAGCAT CGGTAGTGGT
AGTCTTTACT ATAACCAAAA TGGCAGTGCT GCTGGGTTTG GTACTGGTGC GGAGTTGGTT
AATTTGATTA ATTTACCTAG TTTAACTGCG GCTGATTTGG CGATCGTTGC TTGA
 
Protein sequence
MAITILTVNT TTDQNDGSAA NGLSLRDAIL IANANPNTEY EIRLTGGVTY NLTSNGINED 
NALTGDLDIK SRNNVLYIVS VGGEKATIDA SGLLNSDRVF HVLNGGALSL QNVVVTGGKI
SNDGGGIRVD SNGYLDLYNT NVSGNSAEGG NWGGGIYNNN GTVYLRNGST ISNNQALNGG
GILNSGTLIT IDSTISNNNS GSGGGIYNFD TLTVINTTVS NNSARGSGGG IQGNGSFSSI
ALVNTTISGN TAGSGGGGID STGGSVTNIL NSTITNNTAG ILGGGGIRGS ANLKNTIVAG
NFGNYDYQGT GKDIQGTVNG NNYNLIGSLA GASGTVGTGT DIVNPNPGLG PLQNNGGLSL
THTLLAGSPA INAGNNNLIP ADAEDIDRDG DTTEPTPFDQ RGLARVSGGT VDIGAFEVQS
ATLPTITINN VTVTEGNTGT TNATFTVTLS AASTSAVTVN YATANGTATA GNDYTSNTGI
LTFNPGETNK TITVAVRGDT IAEPNQTFFL NLSNSLGATI ADNQGLGTIT DDDANPTIAI
ADVSRNEGNS GTSNTTFTVT LSAASEKTIT VNYGTANGTA TAGSDYTSTT GTLTFNPGDT
SKTFTVAVTG DTTVEANETF FVNLSNATNA TITDNQALAT ILNDDTATLP TLSINDITIV
EGQSSQAVLT VTLSSASSQP VSVNYSTVAG TATANTDYTS RSGTVTFAAN TTTATITIPI
LNDNLNEANE TLRVNLSSPT NATIQKASGT VTITDTLQAS VTTTLAEGIE NLTLLGSSNI
NATGNSSNNT LTGNSGNNIL TGGNGDDTYN FNASTPLGSD RIQEITTGGN DTISFAGTNN
AARVNLGTIA TQTINSNLKL TLSANNVIEN VVGGNGSDRL IGNSLNNTLT GGNGNDVLTG
RGGADTLIGG SGNDILTGGT ESDRFLYSSG RAFTSNDFGI DILTDFTSGS DKLVLSKRTF
TALSSVIGDG LSQVSDFTTV EDDDLAATST AFLVYSIGSG SLYYNQNGSA AGFGTGAELV
NLINLPSLTA ADLAIVA