Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0542 |
Symbol | |
ID | 3682314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 683075 |
End bp | 686059 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637715870 |
Product | TPR repeat-containing protein |
Protein accession | YP_321061 |
Protein GI | 75906765 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.397005 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAC AGCGTTTACA GGCTTATTAT CAGCTAATTC AAACCCTGCT AGATTGCCCT AGTGGGGAAG AACCAGAGAT ATTAGCAGCG AATACAGAAT TGCTAGATGC TGACTTTGTG CAGGTGGTAG GATTAGCAGC AGAGCATTTT GCCCAGCAGG GAGAAGAAAA TAGAGCCAAT TGGTTGAGAA ACTTAGCAAC ATATCTCACC ACCCCAGAAA CGCCCCCCAT CACCGAAGCA GATATAGAGA CTTACTCGCC ATTTATCTTA GAAGTATTGC GAGCAACAGC AGAAAGTAAC GGCAATCCTG AAGTAATTTA CCTGTTGCTG GCAGCTAATA CCGATAAACT AAATCGTATC TTTGCAGAAT TACTGCGCCG TTGGGCAACA AATACCTTAG CAGAAGCAGA ACCAGATACA GCAACATCCA TCGCTAATGT GATTGGTAAC TTTAGTAATC TGATTCAGCA ATTTCCCTTG GGTAGCAAAG CCAACAACAT GGAAATTGCC ATTACTGGCT ACGAAATCAT CCAAACTATT TACACTCGCC CAGCCTATCC TGAAAAATGG GCAACGACGC AAAATAATCT GGCGACTGCT TACTCTGACA GAATATTAGG AAACAGAGGG GAGAATCTGG AGGAGGCGAT CGCTGCTTAT TCTGCGGCTC TGGAAGTTTA TACCCGCACT GATTTTCCTG TAGATTGGGC TGGAACGCAA AATAATCTGG CGATTGCTTA CCGTAACAGA ATATTAGGAA ACAGAGGGGA GAATCTGGAG AAGGCGATCG CTGCTTATTC TGCGGCTCTG GAAGTTTATA CCCGCACTGA TTTTCCTGAA AAATGGGCAA CTATACAAAA TAATCTGGCG ACTGCTTACT TATACAGAAT ATTAGGAAAC AGAGGGGAGA ATCTGGAGAA GGCCATCGCT GTTTATTCTG CGGCTCTGGA AGTTTATACC CGCACTGATT TTCCTGAAAA ATGGGCAATG ACGCAAAATA ATCTGGCGAT TGCTTACTCT GACAGAATAT TAGGAAACAG AGGGGAGAAT CTGGAGCAGG CGATCGCTGC TTATTCTGCG GCTCTGGAAG TTTATACCCG CACTGATTTT CCTCAAAAAT GGGCAATGAC GCAAAATAAT CTGGGGAATG CTTACCGTAA CAGAATATTA GGAAACAGAG GGGAGAATCT GGAGCAGGCG ATCGCTGCTT ATTCTGCGGC TCTGGAAGTT TATACCCGCA CTGATTTTCC TCAAAAATGG GCAATGACGC AAAATAATCT GGGGAATGCT TACCGTAACA GAATATTAGG AAACAGAGGG GAGAATCTGG AGCAGGCGAT CGCTGCTTAT TCTGCGGCTC TGGAAGTTTA TACCCGCACT GATTTTCCTC AAAATCATGC AGAAACTTTG TTTAATCTCG GCATACTATA CCAAGAAGAG AAACAATTTA ACTTAGCTTA CGATACCTTT GCCCAAGCAA TAGCAACCGT CGAAGCTTTA CGGGGTGAAA TTAATGCAGG TGATAATTTA GGTGAAGAAG GCAAGCGCAA ACAAGCAGAA GAATGGAATA AACTTTATAG ACGCATGATA GAAGTTTGCC TAGCATTGGG CAAAGACACT GAAGCGATAG AATATATTGA ACGCAGCAAA ACCCGCTATT TAGTAGAACT ATTAAGCAAA GCAGACTCAA TCAATCAAAA AAATCTCCCT GATATAGACA GCAAAATCCG TTTTGCAGAA ATTCAAAATC TTTTAGACGA TGAAACAGTT ATTATCCAAT GGTATATATT TACTGATTGT TTTCGTGCCT TCATCATCAC TAAAAACCAT CAACCAATCA TTTGGCAATC AGCCTCAGAA AATTTAGATA ATTTAGAAGA ATGGACTGAT AATTATCTGC AAATCTACGG TGAAGATAAA CAAAAATGGC GATATCAGCT AAACGAGCAA CTAACCAAAC TCACTCAAAT CCTTCACCTC AATGAGATAA TTTCTCTCAT CTCCTCCCAA TACAAAAAAC TGATTGTCAT TCCCCATCGT TATTTACATT TATTCCCACT CCATGCTGTA CCCCTAGCTA ATAAACACTC ATCACAACCA GAATATTTAT TTGATAGATT TCCTCATGGC GTAAGTTATG CACCAAGTAA TCAACTTTTA CGATTTACTC AACGGCGATT GCAAAAATTA GCCAACTTAG AATTAAATCC CTTCAGTAAT TTATTTGCCA TTCAAAACCC CACTAACGAT TTAGCCTTCA CCGATATTGA AGTGGAAACC ATAGCCGCAG ATTTTCAACC CCAACAAATC CTCAAACATC ACCAAGCCAC CAAAGCCGCA TTAACAGCAA CACCAACTAA TGAAACCCTC AGCAATTCCC AATGGCTGCA TTTCTCTTGT CACGGTTACT TTAACTTCCG TTCCCCCTTA AAATCTGGTT TACAGTTAGC CGATGCAGTA ACCTCTAACA TTCCCAGCAC CATCAACTCA TCACGCTACC TGAGAATTGA CAACGAAACA GCAATAGATT TAGATAAATG CCTCACCCTA GAAGATATCT TTCAATTAAA CTTAAATAAC TGTCGCCTCG TCTGTCTGTC CGCTTGCGAA ACTGGTTTTA TTGACTATAC AAATAGCAGC GATGAATATA TAGGCTTGGC AAGTGGTTTT ATTCGTGCTG GTGCAACTAA CATGATTAGT AGTTTATGGG CAGTTAGCGA CTTTCACACA GCTTTATTAA TGATTAAATT TTATGAAAAC TTACCACTTT ATCAATATAA TGTGTCCTTA GCCTTGAACC ATACCCAAAC ATGGTTACGC CGAGCAACGC AATCACAAAT TATAGACTGG GTGCAAAGTA AAACCAATAT GCAAAACACA CAACAGCAAA AAATCATTGG CTTTTTACAA CAATATAAAC CTGAACAACA ACCATTTAAA AGACCAGAAT TTTGGGCTGC TTTTTCTGCT ATTTCTCCAG TTTAG
|
Protein sequence | MNEQRLQAYY QLIQTLLDCP SGEEPEILAA NTELLDADFV QVVGLAAEHF AQQGEENRAN WLRNLATYLT TPETPPITEA DIETYSPFIL EVLRATAESN GNPEVIYLLL AANTDKLNRI FAELLRRWAT NTLAEAEPDT ATSIANVIGN FSNLIQQFPL GSKANNMEIA ITGYEIIQTI YTRPAYPEKW ATTQNNLATA YSDRILGNRG ENLEEAIAAY SAALEVYTRT DFPVDWAGTQ NNLAIAYRNR ILGNRGENLE KAIAAYSAAL EVYTRTDFPE KWATIQNNLA TAYLYRILGN RGENLEKAIA VYSAALEVYT RTDFPEKWAM TQNNLAIAYS DRILGNRGEN LEQAIAAYSA ALEVYTRTDF PQKWAMTQNN LGNAYRNRIL GNRGENLEQA IAAYSAALEV YTRTDFPQKW AMTQNNLGNA YRNRILGNRG ENLEQAIAAY SAALEVYTRT DFPQNHAETL FNLGILYQEE KQFNLAYDTF AQAIATVEAL RGEINAGDNL GEEGKRKQAE EWNKLYRRMI EVCLALGKDT EAIEYIERSK TRYLVELLSK ADSINQKNLP DIDSKIRFAE IQNLLDDETV IIQWYIFTDC FRAFIITKNH QPIIWQSASE NLDNLEEWTD NYLQIYGEDK QKWRYQLNEQ LTKLTQILHL NEIISLISSQ YKKLIVIPHR YLHLFPLHAV PLANKHSSQP EYLFDRFPHG VSYAPSNQLL RFTQRRLQKL ANLELNPFSN LFAIQNPTND LAFTDIEVET IAADFQPQQI LKHHQATKAA LTATPTNETL SNSQWLHFSC HGYFNFRSPL KSGLQLADAV TSNIPSTINS SRYLRIDNET AIDLDKCLTL EDIFQLNLNN CRLVCLSACE TGFIDYTNSS DEYIGLASGF IRAGATNMIS SLWAVSDFHT ALLMIKFYEN LPLYQYNVSL ALNHTQTWLR RATQSQIIDW VQSKTNMQNT QQQKIIGFLQ QYKPEQQPFK RPEFWAAFSA ISPV
|
| |