Gene Ava_3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3693 
Symbol 
ID3679112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4601220 
End bp4604210 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content44% 
IMG OID637719044 
Producthypothetical protein 
Protein accessionYP_324194 
Protein GI75909898 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTGGA AATGGTGCTT TCGACTCTCA ATAGTCTTTG TAGGACTTTG GCTACTCTTG 
GATTTGAGTT CCCGCTTGGG GGCAGAGATT TTTTGGTTTC AAGAGGTTGG CTATCTGCAA
GTATTTCTCC TGCGGCTGGC GAGTCGTGGG GCTTTATGGG TGGTTGCTGT GGGTGTAACT
GCTGTCTATC TGTGGGGAAA TTTAACTTTG GCGCAACGGC TAAAGTATCC CCGGTCTTTG
AAGATTGCGG AGGTTAGGCG AGAAGAAGCA GAGTTGAGTG TGGGTCTGAA AAACTTTCTC
AGTCCTCAAT ATTCTCTGTT AAATGCGCCT AAAATTCATG ATGCTGGCCA CTTAAAACCT
TTCAGATTGC GTTGGCTGCT ACCCTTGACT GTGGTCTTCA GCTTATTGGC AGGGTTAATT
TTAGTTCACT ATGGAAAAAT AGCTCTTGCT TACTGGTATC CAGCTTTTAA CAAGAATAGT
TTACCGATAA TTACTCCATT TCGTTTAGAA ACTATCTGGG AACTGGGCAG GCAGGTTTTT
TCCCAAGTTT TATATCTGGG TCTCATTGTC GGCGTAGCGA TCGCTATTCT TATTTACTCA
CAATTTTTCC TCAGGGCGAT CGCTGTTATT CTCAGTGTTG TGTTTGGGAC AATTCTTTTT
CACAACTGGG CTAAGGTTTT ACAGTATTTC TCCCCTACAC CCTTCAACAG CACTGACCCT
TTATTTGGGA AAGATATCAG CTTTTATATA TTTTCCCTGC CATTGTGGGA ATTGCTAGAA
CTGTGGTTAA TGGGGATGTT TTTGTATGGC TTTATTGCTG TAACTCTTAC CTATCTCCTC
TCAGCTGACA GTCTCAGTCA AGGAATTTTC CCTGGTTTTT CACCCCAACA GCAACGCCAT
CTCTACGGTA TGGCTGGCTT ATTAATGTTG ATGGTGGCTT TCAGTTATTG GCTGAGTCGT
TATGAGTTGG TTTATTCGCC TCGCGGGGTG AGTTATGGCG CTAGTTACAC GGATGTGGTC
GTACAGTTAC CCATTTATAA CATCTTGTGT GTTCTGGGAT TAGCGATCGC ATTTTACCTG
TTGTGGCGGA CAATTTTCTG GCGCGCTAAA TCTCAGTATC GCCAATTTGT CTTTTACGGA
TTGGGTGTTT ATTTGTTTGT GGTCGTAGCG GCTGGGTCTG TTTTACCTAC AGTAGTCCAG
TATTTGATTG TTCAACCTAA CGAATTACAA CGGGAACAAC CATACATTCA ACGTACAATT
GCCTTGACTA GGCAAGCATT TAGTTTAGAA ACAATTGATG CTAGAACTTT TAACCCCCAA
GGAAATTTAA CTACAGCCGC TATCCAAGCT AATGATTTGA CGATTCGTAA CATTCGTCTG
TGGGATAAGC GACCACTGTT AGAAACTAAC CGCCAACTGC AACAATTCCG CCCTTACTAT
CGCTTCCCTG ACGCAGATAT CGACCGCTAC ACCTTAGAAG CGGAAGCAGC CGCAAATAGA
CCAGTAAACC CTAACCAGTT GCCAGCACCA ACAGAACGAC GACAGGTATT AATTGCACCC
AGGGAACTAG ATTACAGTGC AGTCCCAGAG CAGGCGCAAA CATGGATCAA CCAGCATTTA
ATTTATACTC ACGGTTACGG GTTTACCATG AGTCCGGTCA ATACGGCTGG GCCTGGTGGA
CTACCAGAAT ATTTTGTCAA AGATATTGCT GGAAGTAACG AAGGCGCACT TTCTACTTCC
AGTGAAGCAG TTCGTGACAG CATTCCTATT GGGCAACCCC GACTTTATTA CGGTGAAATT
ACCAATACTT ATGTAATGAC TGGTACAAAG GTGAGGGAGT TAGACTATCC CAGTGGTAGT
GATAATGCGT ACAATGCTTA TGATGGTTTG GGTGGTGTCA TTATAGGCAA TGGTTGGCGA
CGGGGACTAT TTGCCATGTA TTTAAAAGAT TGGCAAATGT TGTTTACGCA GGACTTTTTA
CCAGAGACAA AGGTATTATT TCGCCGGGAT GTCAAGCAGA GAATTCAGGC GATCGCACCT
TTTTTAAAAT TTGATAGTGA CCCCTATTTA GTTGCGGCTG ATGGTAGTCC AGCATTTCCA
GGGCAGAATA ATTACTTGTA TTGGATTGTC GATGCTTACA CGACGAGCGA TCGCTATCCC
TACTCAGACC CCGATAATAA TGGCATAAAT TACATTCGTA ACTCTGTCAA AGTAGTTATT
GATGCTTACA ACGGCAGTGT AAAATTTTAC ATTGCAGATG CGACAGATCC CATCATTGCT
ACTTGGTCAG CTATATTTCC CCAGATGTTT CAGCCATTGA GTGATATGCC AGTTACTCTC
CGCAGCCATA TCCGCTATCC ATTAGATTAC TTTGGCATCC AATCAGAGCG GTTAATGACC
TATCACATGA CTGACACCCA AGTATTTTAC AACCGAGAAG ACCAATGGCA AATCCCCAAT
GAAATTTATG GCAGTGAAAG CCGTCCAGTA GAACCTTATT ATTTGATTAC TAGTTTACCT
ACCGTCCCCT TTGAAGAATT TCTTCTCCTG CTACCTTATA CCCCCAAACA ACGGACTAAC
TTAATTGCTT GGTTAGCCGC GCGATCTGAT GGTGAGAACT ACGGTAAATT GTTACTGTAT
AACTTTCCTA AGGAACGGCT TGTATTCGGG CCAGAGCAAA TAGAAGCACG TATTAACCAA
GACCCAGTAA TTTCCCAGCA AATTTCCTTA TGGAATCGTC AGGGTTCGAG GGCAATTCAG
GGGAATTTGT TAGTAATTCC CATCGAACAA TCTCTGTTAT ATGTGGAGCC AATTTACCTG
GAAGCAACAC AAAATAGCTT ACCAACTCTC GTGCGGGTAG TCGTAGCTTA CGAAAACCGT
ATTGTCATGG CACAGACCTT GGAACAAGCT TTACAGGCTA TCTTTCAGCC AGAAGTCACA
CCAGCACCAG CAATTATTCG TCCTTTCGAG GAAGTTACTC CACCAGGTTA A
 
Protein sequence
MFWKWCFRLS IVFVGLWLLL DLSSRLGAEI FWFQEVGYLQ VFLLRLASRG ALWVVAVGVT 
AVYLWGNLTL AQRLKYPRSL KIAEVRREEA ELSVGLKNFL SPQYSLLNAP KIHDAGHLKP
FRLRWLLPLT VVFSLLAGLI LVHYGKIALA YWYPAFNKNS LPIITPFRLE TIWELGRQVF
SQVLYLGLIV GVAIAILIYS QFFLRAIAVI LSVVFGTILF HNWAKVLQYF SPTPFNSTDP
LFGKDISFYI FSLPLWELLE LWLMGMFLYG FIAVTLTYLL SADSLSQGIF PGFSPQQQRH
LYGMAGLLML MVAFSYWLSR YELVYSPRGV SYGASYTDVV VQLPIYNILC VLGLAIAFYL
LWRTIFWRAK SQYRQFVFYG LGVYLFVVVA AGSVLPTVVQ YLIVQPNELQ REQPYIQRTI
ALTRQAFSLE TIDARTFNPQ GNLTTAAIQA NDLTIRNIRL WDKRPLLETN RQLQQFRPYY
RFPDADIDRY TLEAEAAANR PVNPNQLPAP TERRQVLIAP RELDYSAVPE QAQTWINQHL
IYTHGYGFTM SPVNTAGPGG LPEYFVKDIA GSNEGALSTS SEAVRDSIPI GQPRLYYGEI
TNTYVMTGTK VRELDYPSGS DNAYNAYDGL GGVIIGNGWR RGLFAMYLKD WQMLFTQDFL
PETKVLFRRD VKQRIQAIAP FLKFDSDPYL VAADGSPAFP GQNNYLYWIV DAYTTSDRYP
YSDPDNNGIN YIRNSVKVVI DAYNGSVKFY IADATDPIIA TWSAIFPQMF QPLSDMPVTL
RSHIRYPLDY FGIQSERLMT YHMTDTQVFY NREDQWQIPN EIYGSESRPV EPYYLITSLP
TVPFEEFLLL LPYTPKQRTN LIAWLAARSD GENYGKLLLY NFPKERLVFG PEQIEARINQ
DPVISQQISL WNRQGSRAIQ GNLLVIPIEQ SLLYVEPIYL EATQNSLPTL VRVVVAYENR
IVMAQTLEQA LQAIFQPEVT PAPAIIRPFE EVTPPG