Gene GSU1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1066 
Symbol 
ID2688650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1151571 
End bp1154615 
Gene Length3045 bp 
Protein Length1014 aa 
Translation table11 
GC content61% 
IMG OID637125735 
Producthypothetical protein 
Protein accessionNP_952119 
Protein GI39996168 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.797091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG ACAATACGAC TCTCAGGAAA TTTGCCCTGC TGGCTGCGGC GATGGCCTTT 
GCCGGAATCG TCGCTTCCCT GGCACTGGCA GCGGTCTCGC AGATCCCTCT GTTCCTGCTC
AATATCTCCC AGCCCAATGT CATGATCCTG CTCGACAACT CGGGCAGCAT GGACATCATC
ATGCAGCACT CGGCGTTCGA TCCCACTGCC CGTTACAGCG GCGGGTTCGA CAACGATCGG
ACCTACTATC AGACAACCAG TAACGGCTAT CACTATCTGT CGACCGGAAA CGACTATATC
CGGGACGACA AAAAGGGCAA CTTCACTAAA AACAGCGTCA CCATCAAGCT TCCGCTTCCC
TATGACGATA CCCGCTGGGA CGGGAACTAT CTCAACTGGC TGTTTTATCA TGCGACATCG
AGCCAGCGCA GCACGGTGAG CACGGATGCA ACGCTCCAGA AAACCCGGAT CCAGACGGCG
CGAGGGGTCA TCAGCAATCT GGTCAAGACC GTGTCAGGCG TCAGGTTCGG GCTGGCGAAA
CTGAATGTGG ACGGCTATGA CAGGTTCGAC AGGAAGCAGA CTGACGGCGG GAGCATTGTC
AGAAACTGCG GCGACCTGAC CTCGGCCAAC GTCGATACGA GCGTGAGCGG GATCAGCGCC
GAAACCTGGA CCCCCCTGGG CGAGGCCTTG TCGGAGGTCT GGCAGTACTT CAAAGGGGGG
ACCTCTCTCT ATAACACCGG CGTTTCCTAT ACGAGCCCCA TCACCTCCAG TTGCCAGAAG
AGCTTCACCA TTGTGGTGAC CGATGGCGAG CCGACCTATG ACGGCTGCTA CCGTGGAGAC
TTCAGCTCCT ACGGGTGCGA CAACGCCGCG GATGCCGACA GCCACCTGGC CGATGTGGCC
GCCCATATGA ACGGCAGCGA CGCAACCTCA GCCTATGGCG GCACCCAGAG CGTCACCACC
TACACCATTG GCATGACCAT TGACAGCAGC CTGCTCCGGA CCACCGCCGA GAACGGCGGC
GGCTCCTACT ATACCACCAC GTCGGGCATG GACCTGGCAA CGGCTCTCCA GAACGCCGTC
AACGAGATCC TCGGCAGACA GTCTTCCGCC AGCGCCGTGG CGGTCAGCAC CGCATATCTC
ACCTCAAACA CCACGCTCTA CCGGGCCCGG TTCGATTCGA CCGACTGGAG CGGCTACCTG
GAGGCCTACG GCATCAACAA GGCAAACGGT GCCGTTACCG GCTATCCCAA TTCGCCGAAG
TGGGAGGCCG GGGCGCTGCT GAATGCCAAT TCCGCCCGCA CTGTCTATAC TGCCGGCGTC
CAATCGGGAG TGTACCGGCG GGTCGACTTT ACCTCCACCA ATGCAGCCAC CCTTGCTCCT
GCCGGGTTCA TGAATTTCTC GTCGGCAAGC ACCGCGTCCA TGATCGGCTA CGTGCGGGGC
GACGTCGAAC CGGCCGGTTA TCGGCATCGG GCGAGCAAGC TCGGCGACAT GGTCCAGTCG
GCTCCGGTCA TTCTCGGGCC GCCGGACGGT TACTACAGCG ACAACAACTA CGCCACGTTC
AAGCGAAACA ACGCCACGCG CCAGTCGCTG ATCCTAGCCG GGGCCAACGA CGGGATGCTG
CACGCGTTCA ATGCCGACAC CGGTGCCGAG GAGTGGGCCT TTATTCCGAA TATTCTCCTG
CCCAAGCTGA AACTGCTCCG TGCCACCCCG TATACTCACA CCAACTACGT CAACGGCGCG
ATCACGGTGG GCGATGCGTT CATTACCGCC AAAGGCCTCG ACGGCAAGTC CGAAACATCG
TCTTCCTGGC GTACCATTGC GGTCTGCGGC CTGCGGGAGG GAGGCAAGGG CTATTTCGCC
CTGGATGTGA CCGACGCCGC CAACCCGATT CCCCTCTGGG AGATCACCAA TACTTCACCA
AGCGAAACGA GCGGTACAGT GGTGGGGCTG GGATACTCCT TCGGCACCCC CCTGATCGTC
AAACTCAAGG ATAGTTCCCA GTCGGGCGGC TTCCGCTGGG TTGCGCTGCT GGCCAACGGT
TACGAGGGGA CCACCTCGGG GCGTGCCGCC ACCCTGATTG TGGCTGACCT GGCAACGGGT
GCGGTCATTC GGGAAATCGT CGCGGACGCG AGCACCTTCA GCGGCGTCTC GCCCAACGGC
CTCGCCACGC CGGCCGCCAT TGACAGGGAT GCCGATGGCT TTGTGGACTA TGTGTATGCC
GGAGATTTGA CCGGCCACCT CTGGAAGTTC GACCTCTCCA GCAGTAACAG CAACAACTGG
GACGTGGTCT GGAAACGCTC GGGGACTCCC GTGGCCCTGT GCCGGGCGAA AACTGCCGCC
GGCAGCGTCC AGCCGATCAC CACGGCCCCC GACGTGGTTC TGCGCGGCGG CTACCAGATC
GTCTTTTTCG GTACCGGCAA ATACTATGAG TCCACGGACA TCTCATCTAC CCAGCCCCAG
ACCTTTTATG GGGCCTACGA TTACAACAGC ACCACCACTC CCACCAGTGC CCAGGCCACT
AACGGGGCCC TGCTCACCCG CGCCGACCTG ACCGCCCAGA CGGTTACCAG GATTGACGAG
AGCGGAACCA GTTGGCGCAC CTCGTCGAAC AATCCGATCG GCCTGACCAA GGGATGGTAT
CTGGACCTCC CCGTGGCCGG GGAGCGGGTG ATCACCGACC CGGTGGCCCG GTCGCGCAAG
ATCATCTTCA CCACCTTCAT CCCCAATACC GATGCCTGCA GCTTCGGCGG GATCAGCTGG
CTCATGGAAC TGAACATGGA CACCGGCGGG GAAGTGGTCA GGCCGGTATT CGATGTGAAT
CTGGACGGGA AGGTGGATTA TAGCGACACG GTACTCGGAG ATCTCAAGGT GAAGCCCACC
GGCACCCTCC TGGGGGACGG TCTCGCATCG ACGCCTGCCA TCGTGGGGGC CGGAGATGAG
CACGAGTACA AGTACATCAC CAAGACCACG GGGGAAATCA TCAAACTGCT GGAGGGGGGC
GGGCATAGCC AGATCGGCCT GCGAAGCTGG CGCCAGCTCA AGTGA
 
Protein sequence
MSTDNTTLRK FALLAAAMAF AGIVASLALA AVSQIPLFLL NISQPNVMIL LDNSGSMDII 
MQHSAFDPTA RYSGGFDNDR TYYQTTSNGY HYLSTGNDYI RDDKKGNFTK NSVTIKLPLP
YDDTRWDGNY LNWLFYHATS SQRSTVSTDA TLQKTRIQTA RGVISNLVKT VSGVRFGLAK
LNVDGYDRFD RKQTDGGSIV RNCGDLTSAN VDTSVSGISA ETWTPLGEAL SEVWQYFKGG
TSLYNTGVSY TSPITSSCQK SFTIVVTDGE PTYDGCYRGD FSSYGCDNAA DADSHLADVA
AHMNGSDATS AYGGTQSVTT YTIGMTIDSS LLRTTAENGG GSYYTTTSGM DLATALQNAV
NEILGRQSSA SAVAVSTAYL TSNTTLYRAR FDSTDWSGYL EAYGINKANG AVTGYPNSPK
WEAGALLNAN SARTVYTAGV QSGVYRRVDF TSTNAATLAP AGFMNFSSAS TASMIGYVRG
DVEPAGYRHR ASKLGDMVQS APVILGPPDG YYSDNNYATF KRNNATRQSL ILAGANDGML
HAFNADTGAE EWAFIPNILL PKLKLLRATP YTHTNYVNGA ITVGDAFITA KGLDGKSETS
SSWRTIAVCG LREGGKGYFA LDVTDAANPI PLWEITNTSP SETSGTVVGL GYSFGTPLIV
KLKDSSQSGG FRWVALLANG YEGTTSGRAA TLIVADLATG AVIREIVADA STFSGVSPNG
LATPAAIDRD ADGFVDYVYA GDLTGHLWKF DLSSSNSNNW DVVWKRSGTP VALCRAKTAA
GSVQPITTAP DVVLRGGYQI VFFGTGKYYE STDISSTQPQ TFYGAYDYNS TTTPTSAQAT
NGALLTRADL TAQTVTRIDE SGTSWRTSSN NPIGLTKGWY LDLPVAGERV ITDPVARSRK
IIFTTFIPNT DACSFGGISW LMELNMDTGG EVVRPVFDVN LDGKVDYSDT VLGDLKVKPT
GTLLGDGLAS TPAIVGAGDE HEYKYITKTT GEIIKLLEGG GHSQIGLRSW RQLK