Gene Ava_4850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4850 
Symbol 
ID3679348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6108360 
End bp6110090 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content47% 
IMG OID637720207 
Productpentapeptide repeat-containing protein 
Protein accessionYP_325342 
Protein GI75911046 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00626433 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATAAAA AAATGTCACA ACTACCTCAT GTTTGGAAAC TATTACAGAA TTACGTTCGG 
GGTACAAGGT CAACAGATAG GCCTACAACA AAGTATGTAG GTAGTCGCTC ATTTTTACCT
CTGGGAAAAT ACCAACAAAA ACCAGCCAAA GATGTAAACT TTTCTCAAGT TAATTTACAG
AAAGAAATTT TATGTGAAAG ATTACAAATC AGTTTAGAGG CATGGATAAG AGAGGATAGT
AGGGGGCAAT TTTCAACTAG TAGTAGCCAA CTCCAACAAG AGATTTATGA TTTACTCGGT
AATGGTGCAT TAACTCCAGA AATAGTCGAA AAAGTTATAG AATTGCCTAT CCTTGAAAGA
GAATTTCGCC CAGAGTTGCT ATTTCACCGC CTAGAAGACT TTTATCAACG TTGGTGTCGA
GGAGAATTTA TCGATGCGCC ACCAAGCGAG AACTTACCTC AGCGAAAATT GCTGGAGATG
GCAGCACAAA ATCAGGGTAT TGGACTGAGG CAGATTGATA TTTACGCCGG GCTTAATGTC
CTGATTTTAC TTCTAGAATT ACACCGCCAC GCCCAAGAGC AAGAAGAACT GCGGCAACAA
ATCAACTTTT ATCCTTCTTC CCCACCAGAT ACAGACAGCT TTTTCACCTC CCAACTACTG
CGGGTAATTA ATTATAGCGA TGCCATTGAA ATTGGCAACT TTAGTAATAT CGTGGGGGAA
TTTCTCCGGG ATGGTAATTT TCAAGGTGCA TACCTGGGAA ATGCCAACTT GACGGGAGTC
AACTTCAGTG GTGCTAACCT CAGTGGTGCA TACCTTGGTG ATGCCAACCT TACAGGCGCA
AACTTCCAAG GTGCTAACCT CACAGGTGCA GACTTTGGTG ATGCCAACCT CAGTAGTGTT
AATCTCAGTG GTGCTAACCT GAGTAGTGCC GACCTCAGCA GCGCCAACCT TACAGGTGCA
AACCTAAGCG GTGCTAACTT GGAACGTGCC GACCTCAGCC GCGCTGACTT GAGTAGCTGC
ATCCTCAATG ATGGCGAATT AAGCCACGCC AACCTCAGTG GTGTCAACTT CAGAGATGCC
GAACTCTGTC GCGCTAACCT CAGCAACGCC ATCTTATTTG GTGCTAACCT CAGTGATGCC
AACCTCAATC ATGTTGACCT CAGTCGCGCC GACCTTTGTC GTGCAGACCT GAGTGGTGCA
GACCTCACCC ACGCCACCCT CAACGGTACT AACCTCAGCG ACACCATTCT TTTTAGTACT
AACTTAAGTG ATGCCATTCT GGAGGCAGCC GACCTCAGTT ATGCCAAACT CAACGGCGCG
AAACTCAACT ACGCCAGACT CAACGGTGCT ATGTTCTTAG GTGCAGACCT CAGTGGCGTA
GATTTAACTG GCGTGGTTCT CAACGATGCC GATTTGAGTG GCGGAATTCT CAGCGAAGCC
GACCTCACAG GTGCAGACCT CAGCGATGCG GTACTTTTGG GTACTGACTT CAGCTTTGCC
AATCTCAACA GCGCCAACCT CAGTGGTAGT AACTTGAGTG GCGCAATTTT GAATGGTGCA
GACCTCAGTA GCGCTAACTT CAGTTATGCC ATTCTCGACG ATACAGACTT AAGTGAAGCC
AACCTAGAAG ATATGACCTG GGGAGAAATT CAGCAATGGG AAGGTGTGCG CGGTTTGGAG
ACAGCGCTGA ATCTGCCGGA GGCGTTGCGG GAAAAGTTGG GAAAGAGGTA A
 
Protein sequence
MDKKMSQLPH VWKLLQNYVR GTRSTDRPTT KYVGSRSFLP LGKYQQKPAK DVNFSQVNLQ 
KEILCERLQI SLEAWIREDS RGQFSTSSSQ LQQEIYDLLG NGALTPEIVE KVIELPILER
EFRPELLFHR LEDFYQRWCR GEFIDAPPSE NLPQRKLLEM AAQNQGIGLR QIDIYAGLNV
LILLLELHRH AQEQEELRQQ INFYPSSPPD TDSFFTSQLL RVINYSDAIE IGNFSNIVGE
FLRDGNFQGA YLGNANLTGV NFSGANLSGA YLGDANLTGA NFQGANLTGA DFGDANLSSV
NLSGANLSSA DLSSANLTGA NLSGANLERA DLSRADLSSC ILNDGELSHA NLSGVNFRDA
ELCRANLSNA ILFGANLSDA NLNHVDLSRA DLCRADLSGA DLTHATLNGT NLSDTILFST
NLSDAILEAA DLSYAKLNGA KLNYARLNGA MFLGADLSGV DLTGVVLNDA DLSGGILSEA
DLTGADLSDA VLLGTDFSFA NLNSANLSGS NLSGAILNGA DLSSANFSYA ILDDTDLSEA
NLEDMTWGEI QQWEGVRGLE TALNLPEALR EKLGKR