Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1066 |
Symbol | |
ID | 2688650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 1151571 |
End bp | 1154615 |
Gene Length | 3045 bp |
Protein Length | 1014 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637125735 |
Product | hypothetical protein |
Protein accession | NP_952119 |
Protein GI | 39996168 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.797091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG ACAATACGAC TCTCAGGAAA TTTGCCCTGC TGGCTGCGGC GATGGCCTTT GCCGGAATCG TCGCTTCCCT GGCACTGGCA GCGGTCTCGC AGATCCCTCT GTTCCTGCTC AATATCTCCC AGCCCAATGT CATGATCCTG CTCGACAACT CGGGCAGCAT GGACATCATC ATGCAGCACT CGGCGTTCGA TCCCACTGCC CGTTACAGCG GCGGGTTCGA CAACGATCGG ACCTACTATC AGACAACCAG TAACGGCTAT CACTATCTGT CGACCGGAAA CGACTATATC CGGGACGACA AAAAGGGCAA CTTCACTAAA AACAGCGTCA CCATCAAGCT TCCGCTTCCC TATGACGATA CCCGCTGGGA CGGGAACTAT CTCAACTGGC TGTTTTATCA TGCGACATCG AGCCAGCGCA GCACGGTGAG CACGGATGCA ACGCTCCAGA AAACCCGGAT CCAGACGGCG CGAGGGGTCA TCAGCAATCT GGTCAAGACC GTGTCAGGCG TCAGGTTCGG GCTGGCGAAA CTGAATGTGG ACGGCTATGA CAGGTTCGAC AGGAAGCAGA CTGACGGCGG GAGCATTGTC AGAAACTGCG GCGACCTGAC CTCGGCCAAC GTCGATACGA GCGTGAGCGG GATCAGCGCC GAAACCTGGA CCCCCCTGGG CGAGGCCTTG TCGGAGGTCT GGCAGTACTT CAAAGGGGGG ACCTCTCTCT ATAACACCGG CGTTTCCTAT ACGAGCCCCA TCACCTCCAG TTGCCAGAAG AGCTTCACCA TTGTGGTGAC CGATGGCGAG CCGACCTATG ACGGCTGCTA CCGTGGAGAC TTCAGCTCCT ACGGGTGCGA CAACGCCGCG GATGCCGACA GCCACCTGGC CGATGTGGCC GCCCATATGA ACGGCAGCGA CGCAACCTCA GCCTATGGCG GCACCCAGAG CGTCACCACC TACACCATTG GCATGACCAT TGACAGCAGC CTGCTCCGGA CCACCGCCGA GAACGGCGGC GGCTCCTACT ATACCACCAC GTCGGGCATG GACCTGGCAA CGGCTCTCCA GAACGCCGTC AACGAGATCC TCGGCAGACA GTCTTCCGCC AGCGCCGTGG CGGTCAGCAC CGCATATCTC ACCTCAAACA CCACGCTCTA CCGGGCCCGG TTCGATTCGA CCGACTGGAG CGGCTACCTG GAGGCCTACG GCATCAACAA GGCAAACGGT GCCGTTACCG GCTATCCCAA TTCGCCGAAG TGGGAGGCCG GGGCGCTGCT GAATGCCAAT TCCGCCCGCA CTGTCTATAC TGCCGGCGTC CAATCGGGAG TGTACCGGCG GGTCGACTTT ACCTCCACCA ATGCAGCCAC CCTTGCTCCT GCCGGGTTCA TGAATTTCTC GTCGGCAAGC ACCGCGTCCA TGATCGGCTA CGTGCGGGGC GACGTCGAAC CGGCCGGTTA TCGGCATCGG GCGAGCAAGC TCGGCGACAT GGTCCAGTCG GCTCCGGTCA TTCTCGGGCC GCCGGACGGT TACTACAGCG ACAACAACTA CGCCACGTTC AAGCGAAACA ACGCCACGCG CCAGTCGCTG ATCCTAGCCG GGGCCAACGA CGGGATGCTG CACGCGTTCA ATGCCGACAC CGGTGCCGAG GAGTGGGCCT TTATTCCGAA TATTCTCCTG CCCAAGCTGA AACTGCTCCG TGCCACCCCG TATACTCACA CCAACTACGT CAACGGCGCG ATCACGGTGG GCGATGCGTT CATTACCGCC AAAGGCCTCG ACGGCAAGTC CGAAACATCG TCTTCCTGGC GTACCATTGC GGTCTGCGGC CTGCGGGAGG GAGGCAAGGG CTATTTCGCC CTGGATGTGA CCGACGCCGC CAACCCGATT CCCCTCTGGG AGATCACCAA TACTTCACCA AGCGAAACGA GCGGTACAGT GGTGGGGCTG GGATACTCCT TCGGCACCCC CCTGATCGTC AAACTCAAGG ATAGTTCCCA GTCGGGCGGC TTCCGCTGGG TTGCGCTGCT GGCCAACGGT TACGAGGGGA CCACCTCGGG GCGTGCCGCC ACCCTGATTG TGGCTGACCT GGCAACGGGT GCGGTCATTC GGGAAATCGT CGCGGACGCG AGCACCTTCA GCGGCGTCTC GCCCAACGGC CTCGCCACGC CGGCCGCCAT TGACAGGGAT GCCGATGGCT TTGTGGACTA TGTGTATGCC GGAGATTTGA CCGGCCACCT CTGGAAGTTC GACCTCTCCA GCAGTAACAG CAACAACTGG GACGTGGTCT GGAAACGCTC GGGGACTCCC GTGGCCCTGT GCCGGGCGAA AACTGCCGCC GGCAGCGTCC AGCCGATCAC CACGGCCCCC GACGTGGTTC TGCGCGGCGG CTACCAGATC GTCTTTTTCG GTACCGGCAA ATACTATGAG TCCACGGACA TCTCATCTAC CCAGCCCCAG ACCTTTTATG GGGCCTACGA TTACAACAGC ACCACCACTC CCACCAGTGC CCAGGCCACT AACGGGGCCC TGCTCACCCG CGCCGACCTG ACCGCCCAGA CGGTTACCAG GATTGACGAG AGCGGAACCA GTTGGCGCAC CTCGTCGAAC AATCCGATCG GCCTGACCAA GGGATGGTAT CTGGACCTCC CCGTGGCCGG GGAGCGGGTG ATCACCGACC CGGTGGCCCG GTCGCGCAAG ATCATCTTCA CCACCTTCAT CCCCAATACC GATGCCTGCA GCTTCGGCGG GATCAGCTGG CTCATGGAAC TGAACATGGA CACCGGCGGG GAAGTGGTCA GGCCGGTATT CGATGTGAAT CTGGACGGGA AGGTGGATTA TAGCGACACG GTACTCGGAG ATCTCAAGGT GAAGCCCACC GGCACCCTCC TGGGGGACGG TCTCGCATCG ACGCCTGCCA TCGTGGGGGC CGGAGATGAG CACGAGTACA AGTACATCAC CAAGACCACG GGGGAAATCA TCAAACTGCT GGAGGGGGGC GGGCATAGCC AGATCGGCCT GCGAAGCTGG CGCCAGCTCA AGTGA
|
Protein sequence | MSTDNTTLRK FALLAAAMAF AGIVASLALA AVSQIPLFLL NISQPNVMIL LDNSGSMDII MQHSAFDPTA RYSGGFDNDR TYYQTTSNGY HYLSTGNDYI RDDKKGNFTK NSVTIKLPLP YDDTRWDGNY LNWLFYHATS SQRSTVSTDA TLQKTRIQTA RGVISNLVKT VSGVRFGLAK LNVDGYDRFD RKQTDGGSIV RNCGDLTSAN VDTSVSGISA ETWTPLGEAL SEVWQYFKGG TSLYNTGVSY TSPITSSCQK SFTIVVTDGE PTYDGCYRGD FSSYGCDNAA DADSHLADVA AHMNGSDATS AYGGTQSVTT YTIGMTIDSS LLRTTAENGG GSYYTTTSGM DLATALQNAV NEILGRQSSA SAVAVSTAYL TSNTTLYRAR FDSTDWSGYL EAYGINKANG AVTGYPNSPK WEAGALLNAN SARTVYTAGV QSGVYRRVDF TSTNAATLAP AGFMNFSSAS TASMIGYVRG DVEPAGYRHR ASKLGDMVQS APVILGPPDG YYSDNNYATF KRNNATRQSL ILAGANDGML HAFNADTGAE EWAFIPNILL PKLKLLRATP YTHTNYVNGA ITVGDAFITA KGLDGKSETS SSWRTIAVCG LREGGKGYFA LDVTDAANPI PLWEITNTSP SETSGTVVGL GYSFGTPLIV KLKDSSQSGG FRWVALLANG YEGTTSGRAA TLIVADLATG AVIREIVADA STFSGVSPNG LATPAAIDRD ADGFVDYVYA GDLTGHLWKF DLSSSNSNNW DVVWKRSGTP VALCRAKTAA GSVQPITTAP DVVLRGGYQI VFFGTGKYYE STDISSTQPQ TFYGAYDYNS TTTPTSAQAT NGALLTRADL TAQTVTRIDE SGTSWRTSSN NPIGLTKGWY LDLPVAGERV ITDPVARSRK IIFTTFIPNT DACSFGGISW LMELNMDTGG EVVRPVFDVN LDGKVDYSDT VLGDLKVKPT GTLLGDGLAS TPAIVGAGDE HEYKYITKTT GEIIKLLEGG GHSQIGLRSW RQLK
|
| |