Gene GSU1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1784 
Symbol 
ID2686500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1947687 
End bp1948895 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content54% 
IMG OID637126464 
Producttype IV pilus biogenesis protein PilC, putative 
Protein accessionNP_952834 
Protein GI39996883 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTAT ACCACTGTAA GCTCGGCTCC TCGGAGGGGC GAATCATCAC CAGGGAACTG 
GAAGCAGCAA ACCCTGAAAT GCTTCGCACC AGTTTGGAAG AGCAGGGTTT TTTTGTTTTC
GAGATAAAAA AAAAGCCCCT GCAGTTTCTC TGGGACAAAG GGGGAGGGCG ACGCAAGGTA
GACAACAAGG CCCTCCTTAC CCTCAACCAG GAGCTTCTCG TTCTGATCAA GGCCGGCTTG
CCGATTATCC AGGCGCTTGA CACGGTACTG GAGCGGGTTG AGCGGGGAAC TCTATTTGAC
GTGCTGGCGG TTGTTCGCGA GGACGTAAAG GGTGGGATGG CTCTCTCCGA TGCCCTGGAA
AAGCACACCA AGGTCTTTCC CCATCTCTAC GTAGCTTCGG TGCGAGCTGG CGAGCGGACC
GGGGACCTGC AGCTCACCAT CCGGCGCTAC ATCGCGTTTC TCAAACGGGT GGAAGAAGTA
CGGAAACGAT TCATTTCCGC GCTGGTCTAT CCCGCAATCC TTGTTACCGT TGCAACGTTG
GCCATCACTT TTCTCCTGGT CTACGTGGTT CCGACCTTCA GCCAGGTTTA TGCGGATGCC
GGTTCCCAGC TTCCACTTCC CACCAGGATA CTCATAGCGT TTTCCACATC ATTGAAGCAG
CTTTTCCCCC TGATTATAGC GGCAGTTATC GGTGCTGTAT TCTTTTTCAG GCGATGGGCC
GCTACCGAGA GCGGGCGGTA TCGGGTTGAT GATATAAAGA TACGAATACC CTTTATCGGT
GATGTCTTTT CCAAATTCGC CGTCAGCTCG TTTACACGGA CATTGGCAAC AGTCATCGGT
AGCGGCATAC CTATTGTCGA ATCGCTCAAG ATGTCGGTGG GCACATTGAA TAATCGCGTG
CTCGAAAGAC GGATGCTCGA AGCCGTGGTC AAGATCGAGG AGGGAATGAG TCTGTCGGGT
GCCATCGAGT CGGCGAGGAT CATGCCGCCT CTCGCGCTCC GGATGCTCGG CGTGGGGGAG
TCGACCGGTT CCCTTGAGGA GATGCTGAGT GATATTGCCG AATACTTCGA GGGGGAAATC
GATGCCCGCC TCCACCTGCT GACAACTGCC ATTGAACCCG CGATCATGAT CGTCATGGGG
CTGGTTGTTG GCGTTATTAT CGTAACCATG TACCTGCCGG TGTTCAAAAT TGCCGGCACT
GTAGGATGA
 
Protein sequence
MALYHCKLGS SEGRIITREL EAANPEMLRT SLEEQGFFVF EIKKKPLQFL WDKGGGRRKV 
DNKALLTLNQ ELLVLIKAGL PIIQALDTVL ERVERGTLFD VLAVVREDVK GGMALSDALE
KHTKVFPHLY VASVRAGERT GDLQLTIRRY IAFLKRVEEV RKRFISALVY PAILVTVATL
AITFLLVYVV PTFSQVYADA GSQLPLPTRI LIAFSTSLKQ LFPLIIAAVI GAVFFFRRWA
ATESGRYRVD DIKIRIPFIG DVFSKFAVSS FTRTLATVIG SGIPIVESLK MSVGTLNNRV
LERRMLEAVV KIEEGMSLSG AIESARIMPP LALRMLGVGE STGSLEEMLS DIAEYFEGEI
DARLHLLTTA IEPAIMIVMG LVVGVIIVTM YLPVFKIAGT VG