Gene NATL1_18661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18661 
SymbolfumC 
ID4780270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1523566 
End bp1524960 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content36% 
IMG OID640085155 
Productfumarate hydratase 
Protein accessionYP_001015686 
Protein GI124026571 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.697239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAACA TGAAACGACT TGAGAAAGAT AGCCTAGGTT CAATTGATGT TCCAAAGGAT 
GCGCTATGGG GAGCTCAAAC TCAACGCTCA ATATTGAATT TTGCTATCGG GAATGAGGTC
ATACCACTCG AGATTATTCT TGCAATTGCC CAAATAAAAG CCTCTGCTGC TCATGTAAAC
AATCATCTAG GTTTAATAAC TACAGAAACT GCTCAATTTA TTACCGAAGC AAGTTTAGAA
ATTATTGAAG GAAAACATAA TGATCAATTC CCAGTGAGAG TTTGGCAAAC AGGCAGTGGT
ACTCAAACCA ATATGAACGT CAATGAGGTG ATCAGCAATA TAGCTTCAAA ACGTTCAAAT
AATGCGTTAG GTAGTCATAA TCCAATACAT CCCAATGATC ATGTCAATTG CTCTCAATCA
ACAAACGATG TTTTTCCAGC TGCAATCCAA ATAGCCACAA TAGTAACCTT AAAAGATACA
TTAATACCTG AATTAACCAA ACTCATTGAT GTATTCCATC AAAAAAGTAA AGAGTGGAAA
GATATTATAA AAATAGGACG TACCCATCTT CAAGATGCAG TACCTCTAAC TCTTGGACAA
GAAGTCTCTG CTTGGGCTGC TCAATTAGAG ACTGCTTTAA AACGCATTGA AATCAACATC
GAGGAACTTT ATCCACTTCC ACTTGGAGGA ACCGCTATTG GTACAGGGAT TAATGCTCCT
AAAGATTTCG ATAATTTAAT CGCTCTTGAA ATAGCCAAAA AAACAAATTT ACCTTTTGAA
ACTGCAGCTA ATAAATTTGC CATCATGGCT AGTCATGATG GTCTTGTAAA TATAATGTCT
CAAATAAAAT TACTAGCCGT CACTTTTCTC AAAATTGTTA ATGATCTTCG ACTTTTATCT
TGTGGACCAA GAGCTGGATT ATCAGAACTA CAGCTTCCAG CCAATGAACC TGGCAGCTCA
ATAATGCCAG GAAAAATTAA TCCTACTCAA TGCGAAGCCA TGGCAATGGT ATGTACGCAA
ATAATTGGCA TGGATACCTC CGTATCGATA GCAGGAAGTG GAGGACATCT ACAAATGAAT
GTCTATAAAC CACTGATTGG CTACAACATT ATAACAAGTA TCAACCTTAT TCAAAATGCC
TGTAAGAGTT GCAGAGAGAA TATGATTGAA AGTATTCAAC CTAATCAAGC AAAAATCAAA
CAATTTCTAG ACAATTCTTT GATGTTGGTA ACCGCCTTAT CGCCATCAAT AGGATACGAA
AAAGCAAGTA AAATTGCTCA ACTAGCCCAT GAAAAGAATC TAAGTCTAAG ACAAGCTTCC
AGTCAATTAA ATTATCTTGA TCAAGAAGAA TTTGATAAAC TCATGTGTCC AAAATCAATG
ATTGGAAGTG ATTGA
 
Protein sequence
MSNMKRLEKD SLGSIDVPKD ALWGAQTQRS ILNFAIGNEV IPLEIILAIA QIKASAAHVN 
NHLGLITTET AQFITEASLE IIEGKHNDQF PVRVWQTGSG TQTNMNVNEV ISNIASKRSN
NALGSHNPIH PNDHVNCSQS TNDVFPAAIQ IATIVTLKDT LIPELTKLID VFHQKSKEWK
DIIKIGRTHL QDAVPLTLGQ EVSAWAAQLE TALKRIEINI EELYPLPLGG TAIGTGINAP
KDFDNLIALE IAKKTNLPFE TAANKFAIMA SHDGLVNIMS QIKLLAVTFL KIVNDLRLLS
CGPRAGLSEL QLPANEPGSS IMPGKINPTQ CEAMAMVCTQ IIGMDTSVSI AGSGGHLQMN
VYKPLIGYNI ITSINLIQNA CKSCRENMIE SIQPNQAKIK QFLDNSLMLV TALSPSIGYE
KASKIAQLAH EKNLSLRQAS SQLNYLDQEE FDKLMCPKSM IGSD