Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3483 |
Symbol | |
ID | 3679795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4317292 |
End bp | 4320456 |
Gene Length | 3165 bp |
Protein Length | 1054 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637718835 |
Product | TPR repeat-containing protein |
Protein accession | YP_323985 |
Protein GI | 75909689 |
COG category | [S] Function unknown |
COG ID | [COG1511] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.423026 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCTG GCATGGGTAT TAATTCCAGA AAGCGATCGC TCGTTGATGG CAATCGGGCG TTAAATTTAT TTACAGACCG TCATGAATTA ACTCGTGTCT TTGCTGCGTA TTTACATGAC GAGCCAGCAG AGAAAATATT ATCTTTTTCT GGCGATGGTG GTAACGGTAA ATCTTTACTA TTGAAGTTTT TACGCACCAA GTGTTGTAAG CGGTTTGGTG CTGATGCTTG GCAGAAATTA AAGACGAAAA CGGCGGCGGA AATCGCAGAT TATATAGAGT CTGCGGATAA TGATCAGTGT GACCTAGTAC CAGCGATTTT ACAGGACTTT GGGCTACAAC CGAATGGCGA TGACCAACCC CAAGACCCAT TTTATGGGTT GTTGATGCTT AGGCGATCGC TTTCACGGGC GGCGACAGAA TTAGGATATC GACTGCGGTT TCCTTTGTAT GACTTTGCCT GTGTTTGGTA TCTCAAACAA AAAAATCGCC TCACACGGGA GAAATTAGCA GAACTGTTTC CCTCCGAAGA AATGGACTTG CTGATTGAAA TCGTCAATGC AGTCAGCGAC ACCTCTTGGG GAACTATCGG TAAAGCTGTT TTCGGAATTT TTAATAAGCA TTTGGGAGAA AACCTGCTTT TACATTGGCA GAAGCGAGGA CTTAAGAAAG AGGATATAGA GGAAATTCGG GGGATGGATG CGGAAACCGA ATTGATGAAC GAACTACCGC GCTATCTGGC TCAAGATTTG AGTGCGGCGA TGTCCCAAGA GAAAGCACCA CCAAGAATTG TTTTGTTTTT TGATACTCAT GAAGCTTTTT GGGGTGGGCA ACGCCAACAA ACGGGCATAC TATACTTTCA ACGGGATGAA TGGCTGCGGT ATTTTTTAGC AGAGTTGGAC TTGAAAGCGG GGATAGTCGC CGTCATTGCG GGAAGAGAAA CACCTCGTTG GGCGCAAGCT GATAATTTTC AGATTCCGCA AAAATATATT GATATTCAGT TAGTCAATCA TCTTTCATCG GCTGATGCAG ATGTGTATTT GCAACGTGCT GAAATTGGGG ATCAGGCTTT GCGGCAAAGT GCGATCGCCT ACTCTAGCGT CACAGCCAAT CAAGTACATC CCTTGCTGTT GGGTTTGTCT GCGGATGTGA TATTACAGGC GCAAGAACAC CTCACACCAG AAGATTTTCC CAAGCAAGAG GCGACATTAA ATAAAGCCAA ATACCTCATG AACCTGCTGC TGAAATATAC AGATAGAGAA TTTGGTTATG CTGTTCATGC TTTAAGTGCT TGTCGCTCAT TTAACTTTGA AATTTACCGT CTACTAGCAG AGGAATTGCA TTTTTCTACC ACCAAACCAG CCTTCGACAT CCTCACAGAA TTTTCCTTTG TTTGGGATGT GGAAAAGTTG GGTGAAAATT GGTATCGCAT CCACGACTTG CTACGGCGCT TAAACTATGA AAACAGTAAT GAAATTACTC AACAGGCTCA TGTTGTTTTA GAAAAACACT ACCGCCAACA GGGACAAGTC GCAGAAGCAA TTTATCACGC TAACCGCTTA GACTGGCGGC GGGGTGTGGA TGAATGGGAA GAAGTGTTTG AGCAAGCCTT GGAGTTGAGT CGTTATGCAC AATGTCGTTC ACTGTTGGAA GTTAGGAGTG AGTTGGTAAT TAACAGCGAT TTTCAAATCG GTAGAGTGTC CCAATCTGAA GGTGATTACT TTGCTCAATT AGCTAAATAT CAAGAAGCGC AAACAGAATA TTTAGAGGCT GTAGCTGCAT ACAACCGAGA ATTAAGCATT ACCCCCGATG ATACCGCCAC TCTCAACAAC AAAGGGTTAG CGTTAGAAAG TTTAGGGAAT TTGCAAACAC AACTAGCCCA GCATACTCAA GCCATACAAT CCTACACTAG TGCGATCGCT GCCTATGACC AAGCCCTCAA TCTCGCTCCT AACTACACCC AAACTATCAA CAACAAAGGG TTAGTCTTAA AAAATTTAGG GGATTTGCAA ACAAAACTCG CCCAGCATCC CCAAGCCATA CAATCCTACA CTAGTGTGAT CGCTGCCTAC GACCAAGCCC TTAATCTCGC TCCTGACTAT ATCAATGCTC TCAATAACAA AGGGGTAGCG TTACAAAGTT TAGGGAATTT GCAAACAAAA CTCGCCCAGC ATCCCCAAGC CATACAATCC TACACTAGTG CGATCGCTGC CTATGACCAA GCCCTCAATC TCGCTCCTGA CTATATCAAT GCTCTCAATA ACAAAGGGGT AGCGTTACAA AGTTTAGGGA ATTTGCAAAC AAAACTCACC CAGCATACCC AAGCCATACA ATCCTACACT AGTGCGATCA CCACCTACGA CCAAGCTCTT AATCTCGCTC CTGACGACAC TTATGCTCTC AACAACAAAG GGAATGCGTT ACAAAGTTTA GGGAATTTGC AAACAAAACT CGCCCAGCAT CCCCAAGCCA TACAATCCTA CACTAGTGCG ATCGCCACCT ACGATCAAGC TCTCAATCTC GCTCCTGACG ACACTTATGC TCTCAACAAC AAAGGGTCAG TCTTAAAAAA TTTAGGGGAT TTGCAAATAA AACTAACCCA GCATAGTGAA GCCATAGAAT CTTACACTAG TGCGATCGCC GCCTACGACC AAGCCCTCAA TCTTGCTCCT AATTACACCT ATGCTCTCAA CAACAAAGGG TTCGCGTTAC AAAGTTTAGG GAATTTGCAA ACAAAACTCG CCCAGCATAG TGAAGCCATA GAATCTTACA CTAGTGCGAT CGCCGCCTAC GATCAAGCCC TCAATCTTGC TCCTAATTAC ACCTATGCTC TCAACAACAA AGGGAATGCG TTAGCAAAAT TAGGGGATTT GCAAACAAAA CTCGCCCAGC ATACCCAAGC CATACAATCC TACACTAGTG CGATCGCTGC CTATGACCAA GCCCTCAATC TCGCTCCTCG CTACATCAAT GCTCTCAACA ACAAAGGGTT AGCGTTACAA GGTTGGGGTA AATTACTTTT ACAGTTATCC CAAAAACCAG AAGCAGTCAA TCATTTACAA GCAGCATTAG CCGTCTTTAA TAGCTCCTTA GCAATAGTTT CCGGTGATGA GAGTGTTCGC AACTTAAGAG ATGAACTACA AGAATTTCTG GATAATTTAA CGTGA
|
Protein sequence | MSAGMGINSR KRSLVDGNRA LNLFTDRHEL TRVFAAYLHD EPAEKILSFS GDGGNGKSLL LKFLRTKCCK RFGADAWQKL KTKTAAEIAD YIESADNDQC DLVPAILQDF GLQPNGDDQP QDPFYGLLML RRSLSRAATE LGYRLRFPLY DFACVWYLKQ KNRLTREKLA ELFPSEEMDL LIEIVNAVSD TSWGTIGKAV FGIFNKHLGE NLLLHWQKRG LKKEDIEEIR GMDAETELMN ELPRYLAQDL SAAMSQEKAP PRIVLFFDTH EAFWGGQRQQ TGILYFQRDE WLRYFLAELD LKAGIVAVIA GRETPRWAQA DNFQIPQKYI DIQLVNHLSS ADADVYLQRA EIGDQALRQS AIAYSSVTAN QVHPLLLGLS ADVILQAQEH LTPEDFPKQE ATLNKAKYLM NLLLKYTDRE FGYAVHALSA CRSFNFEIYR LLAEELHFST TKPAFDILTE FSFVWDVEKL GENWYRIHDL LRRLNYENSN EITQQAHVVL EKHYRQQGQV AEAIYHANRL DWRRGVDEWE EVFEQALELS RYAQCRSLLE VRSELVINSD FQIGRVSQSE GDYFAQLAKY QEAQTEYLEA VAAYNRELSI TPDDTATLNN KGLALESLGN LQTQLAQHTQ AIQSYTSAIA AYDQALNLAP NYTQTINNKG LVLKNLGDLQ TKLAQHPQAI QSYTSVIAAY DQALNLAPDY INALNNKGVA LQSLGNLQTK LAQHPQAIQS YTSAIAAYDQ ALNLAPDYIN ALNNKGVALQ SLGNLQTKLT QHTQAIQSYT SAITTYDQAL NLAPDDTYAL NNKGNALQSL GNLQTKLAQH PQAIQSYTSA IATYDQALNL APDDTYALNN KGSVLKNLGD LQIKLTQHSE AIESYTSAIA AYDQALNLAP NYTYALNNKG FALQSLGNLQ TKLAQHSEAI ESYTSAIAAY DQALNLAPNY TYALNNKGNA LAKLGDLQTK LAQHTQAIQS YTSAIAAYDQ ALNLAPRYIN ALNNKGLALQ GWGKLLLQLS QKPEAVNHLQ AALAVFNSSL AIVSGDESVR NLRDELQEFL DNLT
|
| |