Gene Sfum_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0540 
Symbol 
ID4460578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp658509 
End bp660419 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content59% 
IMG OID639701296 
Producttype IV pilus secretin PilQ 
Protein accessionYP_844674 
Protein GI116747987 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4796] Type II secretory pathway, component HofQ 
TIGRFAM ID[TIGR02515] type IV pilus secretin (or competence protein) PilQ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.614076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.439492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTACG TCGGTTTTCG AAAAATGAGC TGTTTCGCTT TTGTATTGGT GTTTGCCTTG 
TGCTGGGCTT GTGCGATTCC CGGCTCCGTC GAAGGCGGAT CCAGCCCGAA AGCCGGTGTT
GCGAGCCCGA GCCCTGCGGC AGCCACGCCG ACGGCCGAAT CTGCAACGGC GGGAAAAGGC
GCGAAGTTGT TGAGTGTCGC TATGGACGAA TCCGGGTTGG AGACGGTGCT CAGGATAGCG
GGCAGCGGCC CCTTGAAGGA TTATCAATTC CGGCGCCTGG GCGAAGACAG ATTCGTGCTG
GAAATCGGGG ATGTGACCGC CAGGGCGGCC AAGCCCGCGC TGCCCGGCAA GTCGAGCAGG
GTCAAGCTCA ACTATGCGCG AGCGAAGGCG GGGGTCCGGA TCGTCGGGAA TGTCCGGCAG
CCCATCAGTG ATTATACCGT CGACAGTGTC GACAACGACA TTGTTGTGAA CCTGCGTTTC
GCGGAAAAGA AGCAGGTTGC GGCGGCCGGC GCCGCTGCGG AGAAACCTGC CACGGCGGAA
AAGACCAGGC CGGGCAAGGC TGCGGCGCGC CGTGGAGCCG GCGATGCCGG GGAGGAAGTC
CAGGGGGCGG ACCAGCAGTT CCCCGAATAT TCAGCCACCA GGCCGGGGGG GGGAGCCCCC
GAATTGCGTC ATGCCAGGAA GCAGTACACG GGCAAACCGA TCAGCCTTGA TCTGCTCGAC
GCCGATCTGC GGAACGTCCT GCGGCTTCTG GCCGATCTGA CCGGGACGAA CATTGTGATC
GAACCGGACG TCACGGGCAA GGTGACCCTG AAGGTGGAAC AGGTCCCATG GGATCAGGTT
CTGGACATGA TCATATCCAT GAACGATCTG GGACAGGAGC AGGTCGGGAG CGTGATTCGC
ATCGCCAGGC GGAGCAAGCT CAAAAGCGAA TGGGCCCAGC AGGCGGAAGC CATCAGGGCC
AAGCAGGAAT ACCTGTTGAC CAGCAAGGAT CTGGGAGAGA TCAATACGGC ATATCTTACG
GTGAACTACG CCCAGGTGAC CGATGTCGCC AGCAAGATCA ACGAGGCGAA GAGCGATAAG
GGCAAGGTAT CCGTCGACGA GCGGACGAGC CTGATCATTT ACAGCGACTA TCCGGGCCGT
ATCAACAACG CCAGGATGCT TCTGAACAGG CTTGACCGGC CCACTTCCCA GGTGCTCATC
GAAGCCCGCA TCATCACGTT GACTTCGGAA GTCAAACGGT CCCTCGGCAT GAACCTGGGC
TTCGGCGGCG ATACCCCGAA TCATTCGGCG ACCGTGCCGT TCACGGATTT TCTGATCAAC
AGCCCGCCGG CCAATCTGTT CGCTCTGAAC CTTGCGCAGA TGGTCGGTAC GACGCTGCTG
AAAGTGGACC TCACTATTTC GGCCCTCGAA ACCGCCGACG AGATTCGCAT CATGGCGGCT
CCGAGAGTTC TGACCATGAA CAACGTCAAG GCAGTGATCT CCCAGGGTGT GCAAATTCCC
TACCTGAAGG TCGGCGATAC GGCATCCAAC ATCACGGGCA CGGATTTCAA GGACGCCGTG
CTCGAGCTCG CCGTGACGCC GCATATCACT CCTGACCACA AGGTGCGGAT GACCATCGAG
GCAAAACAGG ACGAACCCTC GAGCACCGTA ACCGGAGCGC AGGGGCAGCC CGGTATCGAT
ACGAGAAAGA TTTCCACGGA ACTGCTGGTG GATGACGGCA ACATCATTGT GATCGGCGGC
ATCATCCGAA ATCGGGATGA AGCCAAGAAG ACGGCCACGC CCGGACTCAG CGACGTCCCC
ATTCTGGGCA GGCTGTTCAA ATCAAACGAG GTCGATGCGC AAAGAAACGA AATTTTGATT
TTCATCTGCC CGAAAATTGT GGATGTGACG AAACCTTCTG ATCGCACATA G
 
Protein sequence
MHYVGFRKMS CFAFVLVFAL CWACAIPGSV EGGSSPKAGV ASPSPAAATP TAESATAGKG 
AKLLSVAMDE SGLETVLRIA GSGPLKDYQF RRLGEDRFVL EIGDVTARAA KPALPGKSSR
VKLNYARAKA GVRIVGNVRQ PISDYTVDSV DNDIVVNLRF AEKKQVAAAG AAAEKPATAE
KTRPGKAAAR RGAGDAGEEV QGADQQFPEY SATRPGGGAP ELRHARKQYT GKPISLDLLD
ADLRNVLRLL ADLTGTNIVI EPDVTGKVTL KVEQVPWDQV LDMIISMNDL GQEQVGSVIR
IARRSKLKSE WAQQAEAIRA KQEYLLTSKD LGEINTAYLT VNYAQVTDVA SKINEAKSDK
GKVSVDERTS LIIYSDYPGR INNARMLLNR LDRPTSQVLI EARIITLTSE VKRSLGMNLG
FGGDTPNHSA TVPFTDFLIN SPPANLFALN LAQMVGTTLL KVDLTISALE TADEIRIMAA
PRVLTMNNVK AVISQGVQIP YLKVGDTASN ITGTDFKDAV LELAVTPHIT PDHKVRMTIE
AKQDEPSSTV TGAQGQPGID TRKISTELLV DDGNIIVIGG IIRNRDEAKK TATPGLSDVP
ILGRLFKSNE VDAQRNEILI FICPKIVDVT KPSDRT