Gene GSU1491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1491 
Symbol 
ID2686142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1634843 
End bp1636549 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content59% 
IMG OID637126167 
Producttype IV pilus biogenesis protein PilB 
Protein accessionNP_952542 
Protein GI39996591 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E
[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.10563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGCTA GCAGACTGGG AGAACTCCTG GTTCGAAACA ATATCATCAC CAAGGAACAG 
CTTGCCAAGG CACTGGACGA GCAACGAACC TCCGGCGGCC AACAGCGCCT TGGCTCCATC
CTCGTCAAGA ACGGCCTTGT CACTGAGCCC GATCTCACCA CGTTCCTCTC CAAGCAGTAC
GGTGTCCCCT CCATCAATCT GAGTGAATTC GAAGCCGATA TGGCGGTGGT CAAGATCATC
CCCGCCGACG TGGCCCAGAA GTACCAGATC GTCCCGGTGA ACAGGGCCGG TTCGACCCTG
ATCATTGCCA TGGCTGACCC GTCGAACATC TTCGCCATCG ACGACATCAA GTTCATGACC
GGCTACAACG TGGAGGTGGT CGTCGCTTCC GAGTCGTCCA TCAAGACTGC CATCGACAAG
TACTATGACC AGTCAGCGTC TCTGGCCGAC GTCATGAACG ACCTGGAGAT GGACGATCTG
GAGGTTATCG GCGAGGACGA GGATGTAGAC GTTTCTTCCC TCGAACGGGC GACCGAGGAT
GCGCCGGTCG TCAAGCTCGT GAACCTGATC CTCACCGACG CCATCAAGAA GAAGGCGAGT
GATATCCATA TCGAGCCCTA TGAGCGGACC TTCCGGGTGC GCTATCGGAT CGACGGCGTC
CTCTACGAAG TCATGAAGCC CCCCCTGAAA CTGAAAAACG CCATCACGTC GCGGATCAAG
ATCATGGCTG ACCTGGATAT TGCCGAACGG CGGCTTCCCC AGGACGGCCG CATCAAGATC
AAAATGGGTG GCGGGCAGGA CATGGACTAC CGGGTGTCGG TCCTGCCGAC CCTGTTCGGC
GAGAAGGTGG TTCTGCGACT CCTGGACAAG TCGAACCTCC AGCTCGACAT GACCAAGCTG
GGCTACGAGC CCACGGCGTT GAGCTACTTC AAGGAGGCGA TTCACAAGCC CTTCGGCATG
GTGCTGGTGA CCGGACCCAC GGGGAGCGGC AAGACGGTTT CCCTCTATTC GGCGCTTTCC
GAGCTCAACA AGACTACCGA GAACATCTCC ACGGCCGAAG ACCCGGTGGA GTTCAACTTC
GCCGGCATCA ACCAGGTGCA GATGCACGAA GATATCGGTC TTACCTTTGC CGCCGCGCTC
CGCTCCTTCC TGCGTCAGGA CCCGGACATC ATCATGATCG GAGAGATCCG GGACTTCGAG
ACGGCCGAGA TCGCCATCAA GGCTGCGCTT ACCGGTCACT TGGTTCTCTC TACCCTTCAC
ACCAACGATG CCCCGGCCAC CATCAACCGG CTGTTGAACA TGGGGGTCGA GCCGTTTCTT
GTTGCCTCGG CGGTGAACCT GATTACCGCC CAGCGTCTTG CCCGGCGGGT CTGCTCCGAG
TGCAAGGCCG TGGAGGAGAT ACCGATCCAG GCTTTGATCG ATGCAGGCGT CCCTCCCGAG
GAAGCTCCTG AATATGTCTG CTTTCGCGGC ACCGGTTGTG CCAAGTGCAA CAACACCGGC
TACAAGGGGC GCGTCGGCTT CTATCAGGTA ATGCCCATGC TGGAGGAAAT CAGGGAGCTG
ATTCTCAACG GCGCCAATAC GGCCGAAATC AAACGCGAAT CCATGCGCCT GGGCATCAAG
ACCATGCGCC AATCGGGCCT GACCAAACTC AAGGAGGGGG TTACCTCCTT TGAGGAGGTG
CTACGGGTTA CCGTGGCTGA CGACTAA
 
Protein sequence
MQASRLGELL VRNNIITKEQ LAKALDEQRT SGGQQRLGSI LVKNGLVTEP DLTTFLSKQY 
GVPSINLSEF EADMAVVKII PADVAQKYQI VPVNRAGSTL IIAMADPSNI FAIDDIKFMT
GYNVEVVVAS ESSIKTAIDK YYDQSASLAD VMNDLEMDDL EVIGEDEDVD VSSLERATED
APVVKLVNLI LTDAIKKKAS DIHIEPYERT FRVRYRIDGV LYEVMKPPLK LKNAITSRIK
IMADLDIAER RLPQDGRIKI KMGGGQDMDY RVSVLPTLFG EKVVLRLLDK SNLQLDMTKL
GYEPTALSYF KEAIHKPFGM VLVTGPTGSG KTVSLYSALS ELNKTTENIS TAEDPVEFNF
AGINQVQMHE DIGLTFAAAL RSFLRQDPDI IMIGEIRDFE TAEIAIKAAL TGHLVLSTLH
TNDAPATINR LLNMGVEPFL VASAVNLITA QRLARRVCSE CKAVEEIPIQ ALIDAGVPPE
EAPEYVCFRG TGCAKCNNTG YKGRVGFYQV MPMLEEIREL ILNGANTAEI KRESMRLGIK
TMRQSGLTKL KEGVTSFEEV LRVTVADD