Gene Sfum_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0074 
Symbol 
ID4461164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp90604 
End bp92307 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content57% 
IMG OID639700826 
ProductTPR repeat-containing protein 
Protein accessionYP_844212 
Protein GI116747525 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00635622 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTCTGC GATACCTTCT CATCATTGCA GTGCTTTCCG TGTGCAGTTG CACCCAGCTT 
AAACCGTACG CGGCGACATC TTCCACCCCC AAAGCCCCGG CAGCCCGCTC CGAGTCGGGC
TCACGCGCCG ACTTGTACTT CGAGTACCTG CTCGCCCAAT ATTATGTCAG CGTGAAAGAA
ATCGACAAGG CCATTGAAGC ATACCACGAA GCCCTGAAAA AGGATCCCCG ATCGCCGATG
CTGCTCACCG AGTTGGCCGC TCTCTTGATC CGGCAGGGCA AGATCGAACA GGCCCTCAAG
CTCACCGAAG ACGCCACGAG CTTTGACCGC ACCTACGAGC CCGCCTACAT GCTCCTCGGA
CAGCTGTATG CCGGAATCGG GCAGAATGCC AGGGCCATCG ATGCATATAG CCGGGCCATC
GAGATCAATC CTTCCAACGA GGATGCGCAC CTCCTTCTGG GCGCGCTCTA CGCCCAGGAG
AAGAAATACG ATGAGGCGAT GGAGGCATTC GACCACCTCA AGGCCCTGCT CCCGGACAAC
CCTGTGGCAC TGTACTACAA GGCACGGGTT TTCCTGGACA TGAAACTTTA CAAACAGGCC
GAAAAGATCT ACCTCGACGT TCTTGCGATC GAACCCGCAT TTGAAAACGC CTCTCTCGAC
CTGGCATACG TTTACGAGGT GACCGAACGG CTGAAGGACG CCGAGCAGAC CTATCTGCAG
ATCCTGTCGG CGAATCCGGC CAATGTAAAC GCCCGCACGC GGCTGGGCAA CCTTTACATG
CGCCAGGACA GGCCCGCGGA GGCCTTACGG CATTTCTCCC ATCTTCTGAA GCTCAACCGC
AAGGATGTCG AGAGCCGTTT GAAAGTGGGC ATCATCCACC TTCAACAGAA GGATTACGAG
GAGGCCATCA AGGATTTCAC CTACCTGTTG AAGGACGAGC CGCAGTACGA CCAGGCTTTG
TACTACCTGG CCAGCACCTA TGCGGAAAAA CAGGACTTCG AGCAGGCCAT CCGCAACTTC
CGGCTGATCG CACGGAGCAG CCCGCTCTGG CCCATGGCCC AGACCCGCCT CGCCTTGATT
TTCTCCAAGC AGAAGGACTT CCAGAACGGT GCGGCGGTCC TCAAGGAGGC GATCGACGCC
CAACCGGAAG TGGCCGATCT GTATCTCTAT CTCGGCATCA TCTATGAAGA AGCGAAACAA
TACGAGGATG GCGTCGCGGC GGTGGACAGG GGTCTGGTCA AGACTCCGCG AGACACCGAT
CTCCTGTTCC GAAAAGGGGT CATTCTCGAC AAGATGTCCC GAAGGGATGA CGCCATCGCC
GTGATGAAAC GGATCCTCGA GATCGAGCCT CAGAACGCCA ATGCCTTGAA CTACATCGGG
TACACGTATG CGGAGATGGG AATCAACCTC AACGAAGCCC GCCAGATGAT CAAGGCGGCC
CTGGCCACCG CTCCCGATGA TGGTTACATC ATGGACAGCC TCGCCTGGGT CTACTACAAG
CTGGGGCAAC ACAAGAAGGC TTTGGAGACC ATTCTGGAGG CATTGAAGCG CGTTCCGCAG
GACCCCGTCA TCCACGAGCA TCTCGGGGAC ATTTACTTGA GTCTCGGAAA GAAGAACGAA
GCCATCGAGG CCTATGAGAA AGCGCTGGAG TACAGCCACA CGGAGCCCGA GAAGATCCGG
GAGAAGCTGG ACCGGCTCAA GTAG
 
Protein sequence
MILRYLLIIA VLSVCSCTQL KPYAATSSTP KAPAARSESG SRADLYFEYL LAQYYVSVKE 
IDKAIEAYHE ALKKDPRSPM LLTELAALLI RQGKIEQALK LTEDATSFDR TYEPAYMLLG
QLYAGIGQNA RAIDAYSRAI EINPSNEDAH LLLGALYAQE KKYDEAMEAF DHLKALLPDN
PVALYYKARV FLDMKLYKQA EKIYLDVLAI EPAFENASLD LAYVYEVTER LKDAEQTYLQ
ILSANPANVN ARTRLGNLYM RQDRPAEALR HFSHLLKLNR KDVESRLKVG IIHLQQKDYE
EAIKDFTYLL KDEPQYDQAL YYLASTYAEK QDFEQAIRNF RLIARSSPLW PMAQTRLALI
FSKQKDFQNG AAVLKEAIDA QPEVADLYLY LGIIYEEAKQ YEDGVAAVDR GLVKTPRDTD
LLFRKGVILD KMSRRDDAIA VMKRILEIEP QNANALNYIG YTYAEMGINL NEARQMIKAA
LATAPDDGYI MDSLAWVYYK LGQHKKALET ILEALKRVPQ DPVIHEHLGD IYLSLGKKNE
AIEAYEKALE YSHTEPEKIR EKLDRLK