Gene Ava_4867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4867 
Symbol 
ID3679288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6134485 
End bp6135573 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content33% 
IMG OID637720225 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_325359 
Protein GI75911063 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.116369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000150992 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCACTTC TGACAGAAAC TTTAGAAAGA ATCTTTAAGC ATCTGCAACA ACATAGACCA 
GAGGTAGCCT CACTTCTAAA ACCAGGAAAA AAAATTGAAG AAATTCAATC TCAAATTAAC
AATTTACCTT TTCATTTACC AGAAGAAGTT TATGAACTTT ATCAGTGGCA TAATGGTATA
GATTTTACAA ATATTTCAAG AATTAGTTTA CCTGTTGATA TTAGTTTTAT TCCTGGCTTT
GACTTTCTGC CTTTGGAACA AGCGATAGAA GATTCTCAAG AAATTGAGGA GTTTAGGCAT
GAATATACTT CACCAGAAGA TGAGAACTGT CATAAACCTT GGTTTCCAAT CTTCGGTTCT
GATGATTTAG AATATTTCCT TGTTTTTGGT AATTCAGACA ATAATAAATA TTCATCTATA
ATGCACTGTC ATTTAGGAGG TGGAAGTCTA CCTAAATTGA AATATCCTAA CCTAACAACT
TTCATGCTAA TAGTAGCAGA ATGCTATGAA ACAAACGCTT ATTACCTGAC ACAGCCTTCA
GAAGATAATT ATTTTAGAGT ATTCTTAGAA GAAAATCCAA AACAATTCGC TGAAATTGAA
CGTAAATATT TTCTTGAAGA GTTAGAAGAA TTAATAAAAG CAATATCTCA ACCTAAATGT
TTACTATCAA ACAATATATT TAATAGCTTA TGTCGCTTTA AAGATCCAAG ATTTGTAGAA
CCAATGATTC ACATCCTTCA TTTACCTTTA TCTGAAGTGG ATAATGAAGA AGAAAACATC
TCAATTCGTA TAAGTGCAGC TATAATTCTT GGAGAAATTG GAGATTTGAG AGCAGTAGAA
CCTTTAATGA GAGCATTAGA ATCTCGGTTG AAAGAAGACC GGGGATACTC TGTAGCAAAT
AATGCAGCAG AAGCACTCAG AAAATTAGGA GATCCCAGAG CTATTGAACC GTTGAAACGA
TTTGTGCAAA ATAATCAACA GTATCTTCCA ACTTTCATAC CGTCAACTTC ATGGCTTGAG
CGAATTGCTC TTCAAGAAGC TTTAGAGCAA GCAAATTGGG CAATCAAAGA ACTTGAGAAA
ATAATTTGA
 
Protein sequence
MSLLTETLER IFKHLQQHRP EVASLLKPGK KIEEIQSQIN NLPFHLPEEV YELYQWHNGI 
DFTNISRISL PVDISFIPGF DFLPLEQAIE DSQEIEEFRH EYTSPEDENC HKPWFPIFGS
DDLEYFLVFG NSDNNKYSSI MHCHLGGGSL PKLKYPNLTT FMLIVAECYE TNAYYLTQPS
EDNYFRVFLE ENPKQFAEIE RKYFLEELEE LIKAISQPKC LLSNNIFNSL CRFKDPRFVE
PMIHILHLPL SEVDNEEENI SIRISAAIIL GEIGDLRAVE PLMRALESRL KEDRGYSVAN
NAAEALRKLG DPRAIEPLKR FVQNNQQYLP TFIPSTSWLE RIALQEALEQ ANWAIKELEK
II