Gene Ava_4825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4825 
Symbol 
ID3679401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6062360 
End bp6065389 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content39% 
IMG OID637720182 
Producttetratricopeptide TPR_3 
Protein accessionYP_325317 
Protein GI75911021 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.137513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTTC CTATTATTCC TCTAACTCTA ATTTTAGCTT TAGCTTCCCC ATCTCTAGCA 
CAAGCACCAA CCCCAACCGC CGAAGAACAG ATAACACAGG CTGTGATATT GAATAGTAAC
GGTGAATCCC TAATTTACAA AGATTTTTTT GGTGTAGGGG AGTTACAAGC AGCTTTGGAA
AACTTCCAGC AGGCGTTAGC TATTTTTAAA AAATATGGTG CTAAGGCTGG AGAAGCGAAC
AGCCTTGTAA ATATTGGTTA TGTGTATTTT CGTAAAGGGG AGTATGGGAA AGCGCTCGAA
TATTTTCAGT CCTCCTTAGA TATTCGCAGA AAAACAAGAG ACCGCCAAAA TGAATGGATA
CCTCTTTCTT ATATTGGTGA AGTATATGTC AATTTGGGAC AGTATCCCCA GGCGCTGGAA
TATTATCAGC CAGCTTTAGC TATTATCAAA GAACTGAAAG CAGCTAATCC CAAAGATTCT
AGTTATGCTA CTAGCGAAAA AACTCTGCTG GCTGATATTG GTGCGGTTTA TTTTCGGATG
GGACAGTATA CAAAAGCTCT CGATTTTTAT CAGAAAACAT TGGCAATGCA AAAAGCTGAT
GATGATAAAA TTGGTGGTAT TCAAACTTTG AATAATATCG GTGTAGTTTA CGTTAATTTA
GGCAACTATA AACAAGCCTT AGATTCCTAT CAGCAAGGTT TAGCTAATCT GCAAGAATGC
TGCTCGAATT ATATTGGTAC GAAAGCCGCA ATTATTAATA ACCTTGCCAG TACAAATTTT
AGTTTGGGTC AATATAAAAA ATCTCTAGAA TTAGCCGAAG AATCAGCAAA TATTTATAGC
AGAATTAATC ATGATGCAGA AAAAGCTACG AAACAAGAGA TAAAATTACT TTATGATTAT
TTAGGGCAAA ACTCCCAAGC TTTGCAACAA GTCGCCAGTC GTGCTAATGT TGGTGATGCT
TTTGGTAAGG ACTCTTTTCA GTTCCAAGGT AGAGCCTTAA ATATGAATAA TATTGGACAA
ATTTATTTGA GTTTGGGTAA ATATGACCAA GCATTAAAAT TGTATCAACA AGCTTTAAAT
ATATATCAAG AGAATAGCTA TAAACCGGGA ATTGCTGTAA CTCTGAATAA TATTGCTAAG
GTGCAAAGTA GTTTAGGTAA GTACCTGCAA GCTATTGAGT TAAATCAGCA AGCTTTAACT
ATTTATCAAG AAGTAGGCGA TCGCACCGGG GAAGGTGTGA CAATGAGTAA TCTAGGACAA
ATCTACCAAA AACAAGGTCA GCAGGAGAAA GCTTCAGGAC TGTATCAGCA AGCTTTAGCC
ATGCACAGAC AAGTTAGCGA TAAAGTCAGT GAAGCCGCAA CCCTCAAACT TTTAGCCGAT
ACCCTATCTG CACAAAATCA ACCACAACTA GCGATCGCAT TTTACAAGCA ATCAGTCAAC
CTCACGGAAA GTATTCGCCA AAGTTTACGC ACCATCCCCG CAGATATCCA AAAATCCTAC
ACAGAAACCG TCGCTGAAAG GTATCGCCGC CTGGCTGATT TATTACTCAA ACAAAACCGT
CCCAGCGAAG CACAGCAAGT TCTAGATTTA CTCAAAATCC AAGAAGCCAA TGATTTTATT
GGTAATCGCC GTAGTCAACC CCAAACAACA ACAGCAGTAG TTAACACTGG ACAAAGGGGA
GTGAATACAG AACCCCAGCT AAGCCAAAAA TTACCACTGC AACCCCAAGA ACAGCAGATA
TCCCAAAAGT ATAGCGCCAT TCAAGACCAA GCGATCGCCC TTGGGCAAGA ACTAACAAAC
CTCCGCCAAA CTCCAGCAAA TGCACGCACA GCCACCCAAG AAAAACGCAT TGCTGAATTA
GTGAAACTTG AGCAAACAAT TACGGCTGAG TTTAATAAAT TTACCAAAAC ACCTGCGGTA
GTCGCTCTTG TACAGCAATT ATCTGCCAAT TCTGGACAGG AAAACCTCAG CCTAAGACAA
CTTAATTCCC TGCGGGATAA TTTGCGACAG TTAAACAAAA AAGCCGTCTT ATTATATCCC
TTAGTATTAG ATGACAGATT AGAGTTAGTT GTCGTTACGG CGGATACACC CCCAATTCAT
CGTCCAGTTC CCGTCAAACC AGCAGAACTA AATCAAGTAA TTAATGAATT TCGACAAGCT
ATAGTTGTTC CCTATAAAGA TAGTAAAATA CCAGCAAATA AATTATATAA CTGGCTCATT
AAACCTATAG AAAATGACCT GAAACAAGCT AATGCTCAAG CAATTATTTA CGCTCCCGAT
AGCAAACTCA GATATATACC ATTAGCTGCA TTATATGATG GCAAAAATTG GCTAATCGAG
CATTATATCA TTAATAATAT TACTGCCGCT AGCTTGACCA AATTAAACAG CAAACCCCAA
GCCTCTCTAC CAACTTTAGC CGCAGCCTTT ACCAAAGGCG ACTATAAAGT AGCAGTAGGC
GAACGTCAAG AAGTATTTAG TGGTTTACAA TTTGCTAAAG TTGAAGTAGA CAATTTAGCC
AAGACAATTA AAGGCACAAA AATACTTTTA GATAATGATT TTAGCCCCCA AGTTACAATC
CCCCAAATGA ACGATTACAA AATCGTCCAT CTGGCAACTC ATGGGATGCT GGTGAGTGGT
GATCCAGAAA GTTCATTTAT ATTATTTGGT AACGGCGATC GCGTCACCAT TAAAGATATC
GAAAACTGGT CTTTACCAAA TGTTGATTTA GTAGTATTAA GTGCTTGTCA AACAGGTTTA
GGTAATCAAT TAGGTAACGG TCAAGAAATT CTCGGTCTAG GATACCAAAT TCAATTAACA
GGTGCAAAAG CTTCCATCGC CTCACTATGG GCTGTTTCTG ACGGCGGCAC ACAAGCATTA
ATGGATGGCT TTTATAATGT CTTAAAAACA GGTAATTTAA CTAAATCTGA AGCTTTACGT
ACAGCACAAC TTTCCTTATT GACAGGCAAT AATCAGTTTA ATCATCCCTA TTATTGGGCA
TCATTTATAT TAATTGGCAA TGGGCTTTAA
 
Protein sequence
MRLPIIPLTL ILALASPSLA QAPTPTAEEQ ITQAVILNSN GESLIYKDFF GVGELQAALE 
NFQQALAIFK KYGAKAGEAN SLVNIGYVYF RKGEYGKALE YFQSSLDIRR KTRDRQNEWI
PLSYIGEVYV NLGQYPQALE YYQPALAIIK ELKAANPKDS SYATSEKTLL ADIGAVYFRM
GQYTKALDFY QKTLAMQKAD DDKIGGIQTL NNIGVVYVNL GNYKQALDSY QQGLANLQEC
CSNYIGTKAA IINNLASTNF SLGQYKKSLE LAEESANIYS RINHDAEKAT KQEIKLLYDY
LGQNSQALQQ VASRANVGDA FGKDSFQFQG RALNMNNIGQ IYLSLGKYDQ ALKLYQQALN
IYQENSYKPG IAVTLNNIAK VQSSLGKYLQ AIELNQQALT IYQEVGDRTG EGVTMSNLGQ
IYQKQGQQEK ASGLYQQALA MHRQVSDKVS EAATLKLLAD TLSAQNQPQL AIAFYKQSVN
LTESIRQSLR TIPADIQKSY TETVAERYRR LADLLLKQNR PSEAQQVLDL LKIQEANDFI
GNRRSQPQTT TAVVNTGQRG VNTEPQLSQK LPLQPQEQQI SQKYSAIQDQ AIALGQELTN
LRQTPANART ATQEKRIAEL VKLEQTITAE FNKFTKTPAV VALVQQLSAN SGQENLSLRQ
LNSLRDNLRQ LNKKAVLLYP LVLDDRLELV VVTADTPPIH RPVPVKPAEL NQVINEFRQA
IVVPYKDSKI PANKLYNWLI KPIENDLKQA NAQAIIYAPD SKLRYIPLAA LYDGKNWLIE
HYIINNITAA SLTKLNSKPQ ASLPTLAAAF TKGDYKVAVG ERQEVFSGLQ FAKVEVDNLA
KTIKGTKILL DNDFSPQVTI PQMNDYKIVH LATHGMLVSG DPESSFILFG NGDRVTIKDI
ENWSLPNVDL VVLSACQTGL GNQLGNGQEI LGLGYQIQLT GAKASIASLW AVSDGGTQAL
MDGFYNVLKT GNLTKSEALR TAQLSLLTGN NQFNHPYYWA SFILIGNGL