Gene GSU3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3191 
Symbol 
ID2688377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3496790 
End bp3498706 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content59% 
IMG OID637127884 
ProductTPR domain-containing protein 
Protein accessionNP_954232 
Protein GI39998281 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAT ACCTTACACT CTTCGTGCTT TCCCTGTCGG CCGCGACCGC GCTGGGGGGG 
GAGGCAGTTG CCGCCCAGGA GCGGTGCTGG GAGGCGAAGA GGCTTGTCAA ATCACTTTCG
GGCGAGGATG AGGACGCCCG GGCGAATACT GAAAAACGAA TCCTCGAATA TTGCTCCGAC
GGCGCTGCCG GCCATTTCGT CAAGGGGCTC CGGGCCGAGC GTGAAAAACG CTTCGACCAG
GCGGTGACGG AATATCGCGA AACGCTCCGG ATGGACCCTT CGTTTCCCGA TGCCCAAGGA
AGTCTCGGCC TGGCCCTGCT GGCCAAGGGT TCCGCCGACG AGGCCGTGGT GGAACTGACC
CGCGCCCTGG CCGAAGATTC CAAGCCCCGC TATCATCAGG GCCTCGGCAA AATTTTTGCT
GAACGCTCCC TCTATTCCCT AGCCCTTTAT CATTATGCCG AAGCCCTGCG CGAACTCCCC
GACGACCCGT CGATTCGTGT GGACCTCGCA CAGGTCTACC GGCAGGCAGG CAAAAAGGAA
GAGGCGGAGA AGGAGCTGAA AAAGGCCCTC ACCGTTGCTC CGGCCAATGA AAATGCCCGC
CTTGCCCTCG CATCTCTCTA TCTGGCCGAT GGCCGCACCG AAGGCGCGGT CCAGGAACTC
AAGCAGGCCC AACTGGCCAA TCCCGGCAAC CGCGGCATCC ACCTCCTCCT TGCAGAAGCA
TATGAAAAAC TTGGTGATCG CAAGGCCGCC GAGTACGAAT ACACCCTTTC CGGGCGCCAA
CGAGGCGTCC TTCCCGAAGA ATACCTCCGT CGGGGTGACG AACGGATGGC AGCCAAAGAG
TTCCCCAAGG CCGTGGAGGA GTACCGTGCT GCCCTGAAAG AGCGTCCCGG GTCCGCGGAG
GTGCTGCATA AGCTGTCCGG TGCCCAGGCA GCCGCGGGGC TCGATGACGA TGCCATTGCA
TCCTACCGCG AACTGCTGCG AGTAAAGCCC GGCAATGCCG CTAATCATTA CAACCTCGGC
ATTATCTACG AGCGCAAGGG GCTCATCGAC GAGGCCGTGG TGGAATACAA ACAGGCGGTT
CGACTCTCTG CCGAGCACGG TGACGCCCGG CGCCGACTCG CCGATATCTA CACCCTGCGC
GGCAGCCATC CCCAAGCCAT TGAGCAGTAC CGCGAACTCC TCAAGCGGGG CGACAGCAAT
CCCGTCCTCC ATCTCAAGCT TGCCCGCGGT TTCATGTCCT CGAAGAACAC CAAGGATGCC
ATAGCCTCTT ACAACGAGGC CCTCAAACTG GACCCGGACA ACCTGGAAGC CCATCGGGAA
CTCGCTGCCG TCTACCGCAA GTTGAACCAG ATGGACGACG CGTCCAAGCA GTATCGCGAG
GTGCTCCGCA TCAAAAAAGA CGATGCCGAG GCCCGCAACA TCCTGACAGC TATTTACGTC
AAGGAGAAGA AATACGATGA ACTGGTCCCC CTTCTCCAGG AAGGGGTCGA ATTGGCTCCC
AACGATGCCA TGAGCCACTA CAAGCTCGGG CTCATTCATG AATTCCGCAA GGACTACGAT
TCAGCCGAGG TGAGCTACCG GAAAGCGACC GAACTCAAGG ATGATCACGC AAAGGCTCTC
AACGCCCTGG GGCGCATCTA TCTGAAAACC GGCAAACTTA CCGAGGCCAA GGAGGCCCTG
GAAGCGGCCA AAAAGGCCGA CCCTGGCATG GAAGAGACGG CTGTTCTTCT CAGCAACATC
AAGGATGAGC TGAGCCCGGA GCCGAAGCGG TACGTGAAGA AGAAAGGGAC CAAAGGGCGC
AAGTCCGCCG TTTCCAAGAA GAGGTCCGGC AAGAAAAGCT CCGCCAAGAA GAGTACAACG
AAAAAGAAAT CGACGAAAAA GAGTTCGTCG TCCGGAAAGA AAAAAAGCGG CACATAA
 
Protein sequence
MKKYLTLFVL SLSAATALGG EAVAAQERCW EAKRLVKSLS GEDEDARANT EKRILEYCSD 
GAAGHFVKGL RAEREKRFDQ AVTEYRETLR MDPSFPDAQG SLGLALLAKG SADEAVVELT
RALAEDSKPR YHQGLGKIFA ERSLYSLALY HYAEALRELP DDPSIRVDLA QVYRQAGKKE
EAEKELKKAL TVAPANENAR LALASLYLAD GRTEGAVQEL KQAQLANPGN RGIHLLLAEA
YEKLGDRKAA EYEYTLSGRQ RGVLPEEYLR RGDERMAAKE FPKAVEEYRA ALKERPGSAE
VLHKLSGAQA AAGLDDDAIA SYRELLRVKP GNAANHYNLG IIYERKGLID EAVVEYKQAV
RLSAEHGDAR RRLADIYTLR GSHPQAIEQY RELLKRGDSN PVLHLKLARG FMSSKNTKDA
IASYNEALKL DPDNLEAHRE LAAVYRKLNQ MDDASKQYRE VLRIKKDDAE ARNILTAIYV
KEKKYDELVP LLQEGVELAP NDAMSHYKLG LIHEFRKDYD SAEVSYRKAT ELKDDHAKAL
NALGRIYLKT GKLTEAKEAL EAAKKADPGM EETAVLLSNI KDELSPEPKR YVKKKGTKGR
KSAVSKKRSG KKSSAKKSTT KKKSTKKSSS SGKKKSGT