Gene Ava_4402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4402 
Symbol 
ID3680529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5516586 
End bp5519129 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content44% 
IMG OID637719755 
Producthypothetical protein 
Protein accessionYP_324895 
Protein GI75910599 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00858282 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00244016 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGCCAG AAAAATTGGG ACGTTTTAAA GAGTACGGCG AACTTATTCT GCAAAAATTA 
GATTTTGTTC CTCAATCTCC CTCCCAACAA GAAGATTGGG TTCCTGCTAG TCTGGATGAT
TGTCTTTTAC GTCTACGGGA AGCTGCTCAA AAAACTGTAG AACTGGCTAC ATCGCCTGTG
AAAATTGGCG TGATGGGAGA ATTTAGTAGT GGTAAAACTC TCCTGTTGGG TAGCCTCATT
GGCTATGCAG ACGCTTTACC CATCAGCGAA AATCCCACTA CTGGTAACGT CACAGCGATT
CATCTCATCC CCCATCCAGG TTTTACTACT ACCCAGGTAG GTAATTTCAC GGTCGAGTAT
CTGACTCGTG AGGGGGTAAA TGAGTGCTTA CGCTTCATGT TAGGAGAAGC TAACCGCCGG
ACAATAGCCG CCGGACTTCC AGCAATGCAA CCAGCAAAAC TCAGTTCTGG GAAAGAAATT
CTCGGTTGGT GTGAAGCCAG TTGGAAGAGT AGTAATAATT TAGAGTTGCG TTATATTTTG
CGGGAGTTGG TGCTGTTTAT CCGCGCCTAT AGTTCCTATG GGGAGGCCTT GTGCGGTGGA
CGTTACGAAA TTGACCCTGA TTCCGCCCGT GAAGGTTTAC AGTTGGCGGA GCAGCCTTTG
GCTATCCAAA CTCTCGGCTT TGAAGATTTA CCACCAGCCC ATATCCGTTT ACCGAGTCCA
CCGCAAAAGT TAGCCACTAA GTTATTACAG AATAGTTTCC CTTTAATTCG CCGTGTGGAT
ATTGATGTGA AAATCTCTAG GGAAATTTGG GATATTACAG ATGCTTCCGA ATTTACTCTT
TTGGATTTTC CGGGGTTGGG TGCGGCTAAC TCTGGTGCTA GGGATACCTT CTTATCATTG
CGGGAATTGG CAGAAGTACA GACAATTTTG GTACTGCTTA ATGGTAAATC GCCGGGGAGC
GATCGCGCCA ATAAAATTTT CACGATGATG CAGCAGCAGC GCCCAGGACA AGACCTGAAG
GATTTGATTT TGGTGGGTGT GGGGCGATTT GACCAGTTAC CGTTGGAAAG TGAAGGGGGG
GAAAGACTAC TCGACCAATT AATTGATGAG AGTCGAACAC CGCATTTAAC AGCCGATAAA
GTTTTACAGC AACTACGAGT TTTACAAACC ACCATCGACG GGGCGAGTGC ATTCACCACC
AACAAAGACC GCATTGCTTT ACTATCGCCA CTGTTGGGAT TGGCGGAACT AGCCAAGCGT
TCCAGCACTA TCAAAGCGGG TTCGCCAGAG TTTTTGGCTA ACTTGGACTA TCCCAATTAC
TTGGAGAGGT CAAAACAGTT GCAGCAAAAG TGGGGATATT TGAGCGATCG CCTGCTAGAA
TCAGATCCAC GCAGTCATTT AAGTCGTAAG TTAGGTTACT TTGCTCAAGA CGGGGGTATC
GCCAAGCTAC GGGAATTGAT GCAGAATCAC GTTGCGACTC ACGGACTTAA GCAACTGTAT
GAGGATACTA GCCGCGCCGC CGATAATTTA CGGCAACAAC AGGATAACCT CAAAGGTATC
ATCGCCGAAA TTCATGAGCA AGGCATCCCC ACAGGTGACA GTCAGGCTTT AATTGATTTG
CGGACTGCAT TGGAAAATTT AGATAAAACT TATCGCAACT TCCAAAAAGA TTTGGGTAAA
GAACCACTCA AAGACCGTCG GGGAACTGTG GTCAGCGATG TGGTGAAAGA TGAACTGACC
TTTAGAGTTT TGAGTTGGAA TCATTGGACT TTGTTATTTA ACAAAGCGAA TAATGGCACA
ATCACCATAA CCGAATCGAA GGGTGCAGCC GGTAAGTTAT TTGACAGAGG AAATAGAACC
AATACCAGTA TCCCTACCAA GAGTGATGAT TTTTATCCCG CTTTTGAAAA GACTGTCAAA
GAAGTAGAAG AATTTGCGCG CGATCGCATC CGCCAAGCAG TGGTAGACTT ACTGAGTAAA
TTATCCCAAC AAATTGCTCC AGAGCGGGAA CGTTTGCAAG CACTCCTCAA CCCAGAAATA
GAACAAGATA TTGAAGCTAA ATTCGGTGGA GAAGAAGCTG ATTTATTTTA CCAATTGTTG
TTAGGTAGTG ACCCGATCCA ATGGCAAGCA GCAATCATAT CAGAAATTAA TCATCAAGAA
AAATTCCTCA CCCCAGAAAT TATGTTTCCT CTGGCGCGTC AGGATGAAAA ACACGATATT
GGTCAAATTT TTGATTGGTC ACCAGAAAAA GCACAAACAA TATCTAAGTC CAGCAATCAT
CAAATGTTTG TGCTACGACT CCGGGATGAA ATCACTGCTA GTGCCAGCTT ACATTTGGTG
CAATATGTTA GTGAAGTAAA TCAAAGAGTC AATGCCGAAT TAGATGGAAT TTTAGATCAA
ATTATTCCCA CCTTACAAAA TATCTCCAAA AAAGATGGCT TGCTAAGGTT TATTGCTGCT
GGTGATACCC AATCTTCTGG TGCAGTTCCA GCTTGGTTAC AAAATCTCTC GGAAATTGCC
GATTTAGCGG TCAAATATCC GTAA
 
Protein sequence
MEPEKLGRFK EYGELILQKL DFVPQSPSQQ EDWVPASLDD CLLRLREAAQ KTVELATSPV 
KIGVMGEFSS GKTLLLGSLI GYADALPISE NPTTGNVTAI HLIPHPGFTT TQVGNFTVEY
LTREGVNECL RFMLGEANRR TIAAGLPAMQ PAKLSSGKEI LGWCEASWKS SNNLELRYIL
RELVLFIRAY SSYGEALCGG RYEIDPDSAR EGLQLAEQPL AIQTLGFEDL PPAHIRLPSP
PQKLATKLLQ NSFPLIRRVD IDVKISREIW DITDASEFTL LDFPGLGAAN SGARDTFLSL
RELAEVQTIL VLLNGKSPGS DRANKIFTMM QQQRPGQDLK DLILVGVGRF DQLPLESEGG
ERLLDQLIDE SRTPHLTADK VLQQLRVLQT TIDGASAFTT NKDRIALLSP LLGLAELAKR
SSTIKAGSPE FLANLDYPNY LERSKQLQQK WGYLSDRLLE SDPRSHLSRK LGYFAQDGGI
AKLRELMQNH VATHGLKQLY EDTSRAADNL RQQQDNLKGI IAEIHEQGIP TGDSQALIDL
RTALENLDKT YRNFQKDLGK EPLKDRRGTV VSDVVKDELT FRVLSWNHWT LLFNKANNGT
ITITESKGAA GKLFDRGNRT NTSIPTKSDD FYPAFEKTVK EVEEFARDRI RQAVVDLLSK
LSQQIAPERE RLQALLNPEI EQDIEAKFGG EEADLFYQLL LGSDPIQWQA AIISEINHQE
KFLTPEIMFP LARQDEKHDI GQIFDWSPEK AQTISKSSNH QMFVLRLRDE ITASASLHLV
QYVSEVNQRV NAELDGILDQ IIPTLQNISK KDGLLRFIAA GDTQSSGAVP AWLQNLSEIA
DLAVKYP