Gene P9211_15861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_15861 
SymbolfumC 
ID5731171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1413464 
End bp1414864 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content40% 
IMG OID641285964 
Productfumarate lyase 
Protein accessionYP_001551471 
Protein GI159904127 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAACT CATTTAGAAC AGAGAATGAC AGTCTTGGAT CAATTGAAGT ACCTCGGGAG 
GCATTATGGG GGGCACAAAC CCAAAGGTCT CTATTGAACT TTGCAATTGG ACAAGATCAA
ATTCCTATGA AGCTCATATA CTCCCTGGCA AGAATTAAGC AAGCGGCATC AATTGTTAAT
AATCGACTTG GCACACTGGA TGACAATAAA AGAGATTTAA TCGTTAACGC TGCCTGTGAG
ATAGCTATGG GCAAGCATGA TGCTCATTTC CCATTAAGTG TCTGGCAAAC AGGGAGTGGC
ACACAAACAA ATATGAACTT AAACGAAGTT ATTAGCAACC TCGCCTCAAA AGCTACCGGC
AAACCATTAG GTAGTCATCT GCCATTACAC CCTAATGATG ATGTTAATCA ATCTCAATCA
ACAAATGATG TCTTCCCAGC TGCTATTCAA ATCGCTACTG TACAAGAAAT TCAGACAACT
TTAATCCCTG AACTAGATCT ACTAATTGAG TCATTAACTA AAAAGAGCAA TGCATGGAGT
GAAATTGTAA AAATCGGTCG AACACATTTA CAAGACGCGG TCCCACTTAC CCTTGGGCAA
GAAGTTTCAG CCTGGAGCAC CCAGCTTTCC ACAGCAAGAG AAAGGATAGA AGTCAGTCTC
AGAGAGCTAT ATCCACTTCC ATTAGGAGGT ACAGCCATAG GGACTGGTCT CAACTCACCA
AAGAATTTTG ATAGTGAAAT TGCTACAGAG ATTTCTCTTT TAACTAACTT ACCTTTCAGT
AAGGCTAATA ATAAATTTGC AATTATGGGT AGTCATGATG GATTGGTAAA TGCTATGTCC
CAATTAAAGA TGTTAGCAGT ATCACTATTA AAAATAGTCA ATGATATCCG ACTCTTGTCA
TGTGGTCCAA GAGCTGGGTT TTCCGAACTA CAACTCCCAG AGAATGAGCC TGGAAGTTCA
ATAATGCCTG GGAAAGTCAA CCCAACTCAA TGTGAAGCCA TGTCAATGGT ATGCATGCAA
GTAATAGGTC TTGATACCGC AGTAACTATG GGTGGAGGTG GGGGGCACCT ACAAATGAAT
GCCTACAAAC CATTAATAGG GTTCAATCTT CTAAAAAGCA TTGAACTCTT ACATGATGCG
TGCAGAAGTT GCCGAAAAAA TATGGTAGAG GGCATTAGAC CAAACGAAGA GAAAATTGCA
AAAGATCTTC AGCAATCACT AATGCTTGTC ACAGCGCTGT CTCCTAAGAT TGGGTACGAC
AATGCCACTA AGATTGCCAA ATATGCTTAT GAGACAGGCG TAAGTTTACG AGAAGCAGCT
GTTAATTTTA ATTATGTAGA TGAAGATGAA TTTGATCAAT TAGTTAATCC CATACTGATG
GCCAATCCAA AATCTAAATA G
 
Protein sequence
MTNSFRTEND SLGSIEVPRE ALWGAQTQRS LLNFAIGQDQ IPMKLIYSLA RIKQAASIVN 
NRLGTLDDNK RDLIVNAACE IAMGKHDAHF PLSVWQTGSG TQTNMNLNEV ISNLASKATG
KPLGSHLPLH PNDDVNQSQS TNDVFPAAIQ IATVQEIQTT LIPELDLLIE SLTKKSNAWS
EIVKIGRTHL QDAVPLTLGQ EVSAWSTQLS TARERIEVSL RELYPLPLGG TAIGTGLNSP
KNFDSEIATE ISLLTNLPFS KANNKFAIMG SHDGLVNAMS QLKMLAVSLL KIVNDIRLLS
CGPRAGFSEL QLPENEPGSS IMPGKVNPTQ CEAMSMVCMQ VIGLDTAVTM GGGGGHLQMN
AYKPLIGFNL LKSIELLHDA CRSCRKNMVE GIRPNEEKIA KDLQQSLMLV TALSPKIGYD
NATKIAKYAY ETGVSLREAA VNFNYVDEDE FDQLVNPILM ANPKSK