Gene Sfum_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1028 
Symbol 
ID4460930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1266865 
End bp1268454 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content61% 
IMG OID639701791 
Producthypothetical protein 
Protein accessionYP_845157 
Protein GI116748470 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.743991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0101542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC ATATTCCGGT AATGGCGGTG GTAGTCCTGG CAGCCCTGCT CGCAGTGGGC 
GTGAACGTCA GCCCGGCGGC ACTGTCGGAC ATTGAGAATC TGGGCAAGTT TGTTTTCTTC
CAGAATATAT CCGATCCTCC GCGCATGGCG TGCGCCACCT GCCACGTCCC GAGAGCGGGT
TGGACCTACG GGGTTTCGGG GGTCAACCTG CACCAGGTGG CGGTCACCGG TGCGAATCCC
CACACCAAAG GAGGGCTCAA ACCCCCCATG AGCGCGTATG CCTCATTTTC CCCCCCGTTC
CAGGTCTCTC CCCTCCCTTT CCCCAATTTC CCTTCCGGGT TTCTTGGGGG AGTTTTCTGG
AACGGCCGCT CCGAGGGGGC GGACCCTCAG GTCTTTCCGA ACGGGGCGAC AGTGCCTATC
GGCGATGAGG TCTTTCAGGA CGCCGGAGGC GCCTTTATCC CGGGTTTGAA GACTGCTTAC
GGAAAATACC TGGGACCATT GGCCGAACAG GCGTTCAATC CGTTCTTGAA CCCCGTCGAA
CAGAATCAAA CCCAGCTGGG GGTCTGTCGG ACCGTCGCTT CGGCCGCGTA TGCCCCGCTT
TTCGAGAAAG TCTGGAAGGA ACCGATCACC TGCGAGGAGC GGTTGGCAGT AAACTATAAG
CGAATCGCGG TGTCGCTTGC CGCGTACCAG TCCTCGCCGG AGGTCAACTC ATTCAGCTCC
AGGCGCGATA TCGCGCTCAA AAGGGAGCTC GACGGGATCG ACGTCGACGA CACTCCCGGG
CAGTTCCCGC TCAAGGGCTT GACGGCGCAG GAAAACCTGG GCCACGACCT GTTTTATTCC
ACTCCTGCGA ATCCGTTGAT CGTGAACGGG GTGCCCAAGA TCACCAACTG TTCGCTGTGT
CACGCCAATA ACCCTCCCGT CATCGATCTC TCCACCACTC CTCCCACCTT TACCCCGGGG
GATACGGGAG TGGAGCCGGA ACAGACGTAT TCGGACAATT CATACCACGT GATCGGCGTG
CCCCCGAATC CCGAGATCCC CGGCTTTCCG GTATTCAACG AAGGGCTGAA AGCGCATACG
GGAATTGACG CTCATCTGGC AGCCCAGCGA AGCCCGAGCC TGCGCAACGT GGACAAACGC
CCCGACGCGG ATTTTGTCAA GGCGTACACC CACAACGGAT GGTTCAAGAG CCTGGAATCA
CTCGTTCATT TCTACAACAC GGCCAACGTG AACGGGGTTA CGGCGGCATC CTTCGGCATA
ACCGAATGCG CGGAGGGCAT CGCGACGGAA GTGGATGCGC TGGCGAACAA CTGCTGGCCC
AAGCCCGAGT TTCCCAGTGC GCCCCTGTCC GCCATCAACA TCGGCCTGAT CGGCGACATG
GGGCTGACTC TCGAGGAAGA AGCCGCCATC GTGGCGTACC TGAAAACTTT CACGGACACG
GCCACTCCCA AGAAGCCGCA GCCGTATCTT GAGAGCAAAC CCGGACGCCC GCAAGCCACA
GGGGCGTTGA CGGCCGGCCC CGCTCCTTCG AAGCCGGAAG CGCCGTTGAC GGCCGGTTCC
GCTCCCCGGG AGAAGGCTTC GGGCAGATGA
 
Protein sequence
MKKHIPVMAV VVLAALLAVG VNVSPAALSD IENLGKFVFF QNISDPPRMA CATCHVPRAG 
WTYGVSGVNL HQVAVTGANP HTKGGLKPPM SAYASFSPPF QVSPLPFPNF PSGFLGGVFW
NGRSEGADPQ VFPNGATVPI GDEVFQDAGG AFIPGLKTAY GKYLGPLAEQ AFNPFLNPVE
QNQTQLGVCR TVASAAYAPL FEKVWKEPIT CEERLAVNYK RIAVSLAAYQ SSPEVNSFSS
RRDIALKREL DGIDVDDTPG QFPLKGLTAQ ENLGHDLFYS TPANPLIVNG VPKITNCSLC
HANNPPVIDL STTPPTFTPG DTGVEPEQTY SDNSYHVIGV PPNPEIPGFP VFNEGLKAHT
GIDAHLAAQR SPSLRNVDKR PDADFVKAYT HNGWFKSLES LVHFYNTANV NGVTAASFGI
TECAEGIATE VDALANNCWP KPEFPSAPLS AINIGLIGDM GLTLEEEAAI VAYLKTFTDT
ATPKKPQPYL ESKPGRPQAT GALTAGPAPS KPEAPLTAGS APREKASGR