Gene A9601_18791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_18791 
Symbol 
ID4718617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1614634 
End bp1615926 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content33% 
IMG OID640079613 
Productcystathionine beta-lyase family aluminum resistance protein 
Protein accessionYP_001010269 
Protein GI123969411 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4100] Cystathionine beta-lyase family protein involved in aluminum resistance 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTTG ACAATAACTT AAAACTGGCT GAACACACTG TTTTTTCTGT AGAAGAGAGT 
TTAAGTAAAG TTTTCCAAGA AAGGTCCAAT CAGGTTTTCC AGAAATTAGA AAATATTTTG
ACAATTTTTA AGGAAGAAAA AGTTTCTACT AGTCATTTCA ATCAATCTTC TGGTAGTGGT
CATGATGATA TATCTAGAGA AAAAATTGAT GCAGTTTTTG CAAGATTATT TCTTGCTGAA
AAGGCTGCTG TGAGGATGCA ATTTGTGAGT GGAACCCATG CAATAAGTTC TGTCTTATTT
GGAATTCTTC GACCCGGAGA TGTAATGTTA TCTCTTACAG GACAACCATA TGACACGTTA
GAAGAAGTCA TAGGAATAAG GGGAGGAGGA AAAGGATCAC TTAAAGATTT TGATATTGAA
TACAAGCAAG TAAATATCTG CGAGAATTTT GATTCTTTTG AAGAAAAAAT TGTTCATTTT
TTTAAAGAAA ATTCATGTAA ATTAGTATTC ATACAAAAAA GTTGTGGATA TAGTTGGAGA
AAGTCTCTTA CGAATCATCA GATAGAGAAA ATTTGTAGTC TTATTCATTC TCTTGATACT
AACTGCATAT GTTTTGTTGA TAACTGTTAT GGTGAGCTTG TTGAAGATAG TGAACCAATT
TCTAAAGGGG CAAATATAAT TGCTGGTTCA TTGATTAAAA ATTTGGGAGG AACAATAGTT
CCTACTGGTG GGTACGTTGC AGGAGATGCA GAGTTGGTTG AGATGGCATG TTCTAGATTA
ACCTCACCAG GCATTGGCTC TTCTGCAGGA ATAAATTTTG GATTAGGAAG ATTAATTTTG
CAGGGTTTGT TTTTAGCACC ACAAATTGTT CATGAATCAC TAAAAGGTGC TGATATGGTT
GCAGCAGTCT TTAAGAATTT GGGATTTAAG GTTTTACCAG AGCCAGCAAC TTATAGATCT
GATCTTATTC AGTCAGTAAG ATTGAATAAT CCTGATTTGG TACAAAAAAT TTGTCAATCT
TTTCAAAATT CTTCACCAGT AGATTCTTTT CTAAATGTTG TTCCATCATC AATGGATGGA
TATGATTCAA AATTATTAAT GGCAGGAGGT ACCTTTATTG AAGGTAGTAC AAGTGAATTT
TCTGCTGATG CCCCTCTAAG AGATCCTTAT AATATTTTTG TTCAAGGTGG TTCTCACATA
GCTCACATCA AAATTGCATT AATTCGATTA TTATCTGAAC TATTAGAGGA AAAATTAATT
TCAAAGGATT CTCTACTTCC TTTATCTACT TAA
 
Protein sequence
MTLDNNLKLA EHTVFSVEES LSKVFQERSN QVFQKLENIL TIFKEEKVST SHFNQSSGSG 
HDDISREKID AVFARLFLAE KAAVRMQFVS GTHAISSVLF GILRPGDVML SLTGQPYDTL
EEVIGIRGGG KGSLKDFDIE YKQVNICENF DSFEEKIVHF FKENSCKLVF IQKSCGYSWR
KSLTNHQIEK ICSLIHSLDT NCICFVDNCY GELVEDSEPI SKGANIIAGS LIKNLGGTIV
PTGGYVAGDA ELVEMACSRL TSPGIGSSAG INFGLGRLIL QGLFLAPQIV HESLKGADMV
AAVFKNLGFK VLPEPATYRS DLIQSVRLNN PDLVQKICQS FQNSSPVDSF LNVVPSSMDG
YDSKLLMAGG TFIEGSTSEF SADAPLRDPY NIFVQGGSHI AHIKIALIRL LSELLEEKLI
SKDSLLPLST