Gene GSU3312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3312 
SymbolhemH 
ID2686646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3636130 
End bp3637083 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content62% 
IMG OID637128006 
Productferrochelatase 
Protein accessionNP_954352 
Protein GI39998401 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00205193 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACA AAACCGCTGT CCTGCTCCTC CAGATGGGAG GTCCCGATTC CCTCGACGCC 
GTGGAGCCGT TCCTGCTGAA CCTTTTCTCC GACCGGGACA TCATCCGGAT CGGTCCCGCC
TTTCTCCAGC CGTTCATCGC TCGGCTCATC GCCAAGCGCC GCTCGCCGGG GGTGGAGCGC
AAATACGAGG AGATCGGCGG CAAATCACCC ATCCGCGAGC TGACCGAGAG CCAGGCCCGG
GCACTGGAGG ATGTGCTGGG GGACGGGTAT CGCTGCTTCG TGGCCATGCG CTACTGGAAA
CCCTCAACCA TGGAGGCGCT GGCGGCCATC AGGAGGGAGG GGATTTCGCG GGTGATTGCG
CTTTCCCTCT ACCCCCACTA CTCCCGTGCC ACCACCGGCT CGAGCGTGAA TGAACTGAAG
CGGGTGCTGT CTCAATCGGG TGTGCAGTTC CAGATGATGT ACGTGGACCG TTTTTTCGAC
CACCCCCTTT ACATTGATGC CCTGGCAGAG AAAATCCGGG AAGGGCTCGA CGATTTCCAC
CCCCTGGCCG AGGTGCAGGT CCTGTTTTCG GCCCATTCGC TCCCTCAATC CTTCATCGAC
GAGGGTGACC CCTACCTGGA CCATATCCGG GAGACGGTAC GTCTCGTCAT GGAGCGGTTC
GAGGGGGTCA CCCACCATCT GGCCTTCCAG TCCCGGGCAG GGCCGGTCAA GTGGCTGGAA
CCCTCAACCG ACGAGATGCT GGAGCACCTG GCGGCCCACC AGGTGAAAAA CCTGCTCATC
GTTCCGCTTT CCTTTGTCTC GGACCATATC GAAACCCTGC ACGAGATTGA CATCGAGTAT
GCCCAGGAGG CGCATAAGCT GGGCTATTCC CGGTTCCGGC GGAGCCCTTC CCTCAACACG
TCGCCCACTT TTATCTCCTG CCTGGCGGAT CTGGTGCGGC GGGTGGAGGG CTGA
 
Protein sequence
MSDKTAVLLL QMGGPDSLDA VEPFLLNLFS DRDIIRIGPA FLQPFIARLI AKRRSPGVER 
KYEEIGGKSP IRELTESQAR ALEDVLGDGY RCFVAMRYWK PSTMEALAAI RREGISRVIA
LSLYPHYSRA TTGSSVNELK RVLSQSGVQF QMMYVDRFFD HPLYIDALAE KIREGLDDFH
PLAEVQVLFS AHSLPQSFID EGDPYLDHIR ETVRLVMERF EGVTHHLAFQ SRAGPVKWLE
PSTDEMLEHL AAHQVKNLLI VPLSFVSDHI ETLHEIDIEY AQEAHKLGYS RFRRSPSLNT
SPTFISCLAD LVRRVEG