Gene Ava_0542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0542 
Symbol 
ID3682314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp683075 
End bp686059 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content41% 
IMG OID637715870 
ProductTPR repeat-containing protein 
Protein accessionYP_321061 
Protein GI75906765 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.397005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC AGCGTTTACA GGCTTATTAT CAGCTAATTC AAACCCTGCT AGATTGCCCT 
AGTGGGGAAG AACCAGAGAT ATTAGCAGCG AATACAGAAT TGCTAGATGC TGACTTTGTG
CAGGTGGTAG GATTAGCAGC AGAGCATTTT GCCCAGCAGG GAGAAGAAAA TAGAGCCAAT
TGGTTGAGAA ACTTAGCAAC ATATCTCACC ACCCCAGAAA CGCCCCCCAT CACCGAAGCA
GATATAGAGA CTTACTCGCC ATTTATCTTA GAAGTATTGC GAGCAACAGC AGAAAGTAAC
GGCAATCCTG AAGTAATTTA CCTGTTGCTG GCAGCTAATA CCGATAAACT AAATCGTATC
TTTGCAGAAT TACTGCGCCG TTGGGCAACA AATACCTTAG CAGAAGCAGA ACCAGATACA
GCAACATCCA TCGCTAATGT GATTGGTAAC TTTAGTAATC TGATTCAGCA ATTTCCCTTG
GGTAGCAAAG CCAACAACAT GGAAATTGCC ATTACTGGCT ACGAAATCAT CCAAACTATT
TACACTCGCC CAGCCTATCC TGAAAAATGG GCAACGACGC AAAATAATCT GGCGACTGCT
TACTCTGACA GAATATTAGG AAACAGAGGG GAGAATCTGG AGGAGGCGAT CGCTGCTTAT
TCTGCGGCTC TGGAAGTTTA TACCCGCACT GATTTTCCTG TAGATTGGGC TGGAACGCAA
AATAATCTGG CGATTGCTTA CCGTAACAGA ATATTAGGAA ACAGAGGGGA GAATCTGGAG
AAGGCGATCG CTGCTTATTC TGCGGCTCTG GAAGTTTATA CCCGCACTGA TTTTCCTGAA
AAATGGGCAA CTATACAAAA TAATCTGGCG ACTGCTTACT TATACAGAAT ATTAGGAAAC
AGAGGGGAGA ATCTGGAGAA GGCCATCGCT GTTTATTCTG CGGCTCTGGA AGTTTATACC
CGCACTGATT TTCCTGAAAA ATGGGCAATG ACGCAAAATA ATCTGGCGAT TGCTTACTCT
GACAGAATAT TAGGAAACAG AGGGGAGAAT CTGGAGCAGG CGATCGCTGC TTATTCTGCG
GCTCTGGAAG TTTATACCCG CACTGATTTT CCTCAAAAAT GGGCAATGAC GCAAAATAAT
CTGGGGAATG CTTACCGTAA CAGAATATTA GGAAACAGAG GGGAGAATCT GGAGCAGGCG
ATCGCTGCTT ATTCTGCGGC TCTGGAAGTT TATACCCGCA CTGATTTTCC TCAAAAATGG
GCAATGACGC AAAATAATCT GGGGAATGCT TACCGTAACA GAATATTAGG AAACAGAGGG
GAGAATCTGG AGCAGGCGAT CGCTGCTTAT TCTGCGGCTC TGGAAGTTTA TACCCGCACT
GATTTTCCTC AAAATCATGC AGAAACTTTG TTTAATCTCG GCATACTATA CCAAGAAGAG
AAACAATTTA ACTTAGCTTA CGATACCTTT GCCCAAGCAA TAGCAACCGT CGAAGCTTTA
CGGGGTGAAA TTAATGCAGG TGATAATTTA GGTGAAGAAG GCAAGCGCAA ACAAGCAGAA
GAATGGAATA AACTTTATAG ACGCATGATA GAAGTTTGCC TAGCATTGGG CAAAGACACT
GAAGCGATAG AATATATTGA ACGCAGCAAA ACCCGCTATT TAGTAGAACT ATTAAGCAAA
GCAGACTCAA TCAATCAAAA AAATCTCCCT GATATAGACA GCAAAATCCG TTTTGCAGAA
ATTCAAAATC TTTTAGACGA TGAAACAGTT ATTATCCAAT GGTATATATT TACTGATTGT
TTTCGTGCCT TCATCATCAC TAAAAACCAT CAACCAATCA TTTGGCAATC AGCCTCAGAA
AATTTAGATA ATTTAGAAGA ATGGACTGAT AATTATCTGC AAATCTACGG TGAAGATAAA
CAAAAATGGC GATATCAGCT AAACGAGCAA CTAACCAAAC TCACTCAAAT CCTTCACCTC
AATGAGATAA TTTCTCTCAT CTCCTCCCAA TACAAAAAAC TGATTGTCAT TCCCCATCGT
TATTTACATT TATTCCCACT CCATGCTGTA CCCCTAGCTA ATAAACACTC ATCACAACCA
GAATATTTAT TTGATAGATT TCCTCATGGC GTAAGTTATG CACCAAGTAA TCAACTTTTA
CGATTTACTC AACGGCGATT GCAAAAATTA GCCAACTTAG AATTAAATCC CTTCAGTAAT
TTATTTGCCA TTCAAAACCC CACTAACGAT TTAGCCTTCA CCGATATTGA AGTGGAAACC
ATAGCCGCAG ATTTTCAACC CCAACAAATC CTCAAACATC ACCAAGCCAC CAAAGCCGCA
TTAACAGCAA CACCAACTAA TGAAACCCTC AGCAATTCCC AATGGCTGCA TTTCTCTTGT
CACGGTTACT TTAACTTCCG TTCCCCCTTA AAATCTGGTT TACAGTTAGC CGATGCAGTA
ACCTCTAACA TTCCCAGCAC CATCAACTCA TCACGCTACC TGAGAATTGA CAACGAAACA
GCAATAGATT TAGATAAATG CCTCACCCTA GAAGATATCT TTCAATTAAA CTTAAATAAC
TGTCGCCTCG TCTGTCTGTC CGCTTGCGAA ACTGGTTTTA TTGACTATAC AAATAGCAGC
GATGAATATA TAGGCTTGGC AAGTGGTTTT ATTCGTGCTG GTGCAACTAA CATGATTAGT
AGTTTATGGG CAGTTAGCGA CTTTCACACA GCTTTATTAA TGATTAAATT TTATGAAAAC
TTACCACTTT ATCAATATAA TGTGTCCTTA GCCTTGAACC ATACCCAAAC ATGGTTACGC
CGAGCAACGC AATCACAAAT TATAGACTGG GTGCAAAGTA AAACCAATAT GCAAAACACA
CAACAGCAAA AAATCATTGG CTTTTTACAA CAATATAAAC CTGAACAACA ACCATTTAAA
AGACCAGAAT TTTGGGCTGC TTTTTCTGCT ATTTCTCCAG TTTAG
 
Protein sequence
MNEQRLQAYY QLIQTLLDCP SGEEPEILAA NTELLDADFV QVVGLAAEHF AQQGEENRAN 
WLRNLATYLT TPETPPITEA DIETYSPFIL EVLRATAESN GNPEVIYLLL AANTDKLNRI
FAELLRRWAT NTLAEAEPDT ATSIANVIGN FSNLIQQFPL GSKANNMEIA ITGYEIIQTI
YTRPAYPEKW ATTQNNLATA YSDRILGNRG ENLEEAIAAY SAALEVYTRT DFPVDWAGTQ
NNLAIAYRNR ILGNRGENLE KAIAAYSAAL EVYTRTDFPE KWATIQNNLA TAYLYRILGN
RGENLEKAIA VYSAALEVYT RTDFPEKWAM TQNNLAIAYS DRILGNRGEN LEQAIAAYSA
ALEVYTRTDF PQKWAMTQNN LGNAYRNRIL GNRGENLEQA IAAYSAALEV YTRTDFPQKW
AMTQNNLGNA YRNRILGNRG ENLEQAIAAY SAALEVYTRT DFPQNHAETL FNLGILYQEE
KQFNLAYDTF AQAIATVEAL RGEINAGDNL GEEGKRKQAE EWNKLYRRMI EVCLALGKDT
EAIEYIERSK TRYLVELLSK ADSINQKNLP DIDSKIRFAE IQNLLDDETV IIQWYIFTDC
FRAFIITKNH QPIIWQSASE NLDNLEEWTD NYLQIYGEDK QKWRYQLNEQ LTKLTQILHL
NEIISLISSQ YKKLIVIPHR YLHLFPLHAV PLANKHSSQP EYLFDRFPHG VSYAPSNQLL
RFTQRRLQKL ANLELNPFSN LFAIQNPTND LAFTDIEVET IAADFQPQQI LKHHQATKAA
LTATPTNETL SNSQWLHFSC HGYFNFRSPL KSGLQLADAV TSNIPSTINS SRYLRIDNET
AIDLDKCLTL EDIFQLNLNN CRLVCLSACE TGFIDYTNSS DEYIGLASGF IRAGATNMIS
SLWAVSDFHT ALLMIKFYEN LPLYQYNVSL ALNHTQTWLR RATQSQIIDW VQSKTNMQNT
QQQKIIGFLQ QYKPEQQPFK RPEFWAAFSA ISPV