Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1945 |
Symbol | |
ID | 2685513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2130067 |
End bp | 2133210 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637126636 |
Product | fibronectin type III domain-containing protein |
Protein accession | NP_952994 |
Protein GI | 39997043 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.771273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCC GCACGCTAAC AACTCTTATT CTTGCCTGGC TGCTCAGTTT TACCATGGTG GCAGCCAGCC AGGCCCGGGA CGTGACCCTG CAGTGGGATG CAAACACTGA AACGACCGTT GCCGGCTACA AGGTCTACTA CAACGCCGAC TCGGCAGGCC CTCCCTTCAG CGGGACCGGA ACGGTCGGCA AAGTGACCTC CACCACTCTC ACCGGTCTCG ACCCGAGCAA GACCTATTAT TTCGCCGTCA CCGCCTACGA TGCCACCGGC ACTGAAAGCA CCTACTCAAA TATCGTAAGC GCGGCCGAAG CAACTGCCCC CACCGTTTCG ATTACATCCC CCGCCTCGGG CTCGTCCATC TCCGGTACCA CCAGCGTTGC CATCAGCGCC ACTGACAATG TGGGCGTCAC ATCGGTTGAA CTCTATGTGG ACGGCGTCCT CAAAGGAACC GACACCTCAT CTCCGTACAG TGTGAGCCTG AACACCACCC AGCTTTCCGC CGGCACTCAC ACTCTCCAGG CCAAAGCGTA CGACGCCGCC GGGAACGTGG GACAGTCCAC GGCGTTTTCC GTGACAGTGG TAAATGATAC GACGGCTCCT ACGGCTTCCA TTACCTCTCC GACCAGCAGT TCAACCGTGT CGGGAACCGT TACGGTAAAC GTTTCCGCCA CCGACGCCAT GGGCGTGTCA AAGGTGGAGC TCTATGTTAA TGGTTCCCTT TACGCGACCT ACGGTTCGGC TCCCTACTCG ATAACCTGGA ACACCTCGTC GTATGCCAAC GGGTCGTACA CCCTCCAGGC CAAGGCTTAT GACGCGGCGG GCAACGTTGG TCAGTCTTCC TCGGTGGCAG TCACCGTAAG CAATACGGTC GCCGACACCA CGGTTCCGAC CGTTGCCGTG AGCTCCCCCG CCAATGGCGC GACCGTGACC GGTACGGTGA GCATGGCCGC CACGGCGTCG GACAACGTCG GTGTGAGCAA GGTCGAATTC TATGTGAACA ACGTGCTCAA GGGGTCGGAT ACGACGTCTC CCTACAACTA CAGCTGGGAT ACCACCTCCA CGGCCAACGG CAGCTACTCC CTGACCGCCA AGGCTTATGA CGCGGCGGGC AACGTTGGTC AGTCTTCCTC GGTTACGGTC ACCGTAAGCA ATACGGTCGC CGATACCACG GTTCCGACCG TTGCCGTGAG CTCCCCCGCC AATGGTGCAA CCGTGACCGG TACGGTGAGT ATGGCCGCCA CGGCGTCGGA CAACGTCGGT GTGAGCAAGG TCGAATTCTA TGTGAACAAC GTGCTCAAAG GGTCGGATAC GACGTCTCCC TACAACTACA GCTGGGATAC CACCTCCACG GTCAACGGCA GCTACTCCCT GACCGCCAAG GCTTATGACG CGGCGGGCAA TGTGGGGCAG TCGAGCAGCG TAACCGTGAG CGTGAACAAT GTGACGACTC CCCCGTCGGG AAGCAACACG GCCATCTTCG GTAATGCCTT TGGCGCCAAT TTCCCGAATA CCGTGGAAGA CACCTTCCTG AATATCAACG ACGACGTGAA CGCCACCGGC GTAAGCCTCA GCACCTATAC CTGGCCCGCT GCCACGCCGG CCAACGCAGT CGTGATGAAG TGGGACGTTT CCGCACTGCC TGCCAATGCC GAGATCCAGA GCGCCACGCT CTACCTCTAC CTGACTGAAG GCGGTGGTGA CGACGCTTAC GAAATTCCGG TCTCCGCCAT CATCAACAAG AACCCGGTTG TCGCTTCAAG CACTGGCAAT ACCTACGACG GGACCAATGC CTGGACCGCG AGCAGTGTGG CTTACGGCGG GGTGCCCCTG GCCCAGTCCG ATATCGACAC GCCGGTTGAT GCACCTCTGG TTGACAAGAC CGTTGGTTAT AAAGCCTGGA ATATAACCAA TCTGGTCAAG ACCTGGCTGG CGACGCCGGC CGCCAACAGG GGTGTTCTGC TCAACTCCTC CAACAAGGCC GCCGTTGACA GCTACAGGCT CTTCGCTTCC AGCGAGGCGT CCGATACCAA TCTGCGGCCG AAACTGGTGG TAACCTACAA TCTGCCGTCC GATACTGCTG CTCCGACGGT GGCGGTGAGT GCCCCGGCTA ATGGTGCGAC CGTGAGCGGA ACGGTGACCG TCAGCGCAAC AGCCTCTGAC AACGTGGGCG TGACCAAGGT GGAATTCATG GTGAACGGCA CGGTCGCTTC CACGGTGACA ACCGCCCCTT ACAGCTACAG CTGGAACACC ACGACCTCGG CCAACGGCAC CTACACCCTG ACTGCCAAGG CCTATGACGC TGCCGGCAAC ATCGGCCAAT CTACCTCCGT ATCGGTAACG GTGAACAACC AGATCGGCGA CACCACTGCG CCGACCGTAT CGATCACTTC TCCGGCCAAT AACGCGACCG TCAAAGGAAG CATTACCGTC AGCGCCAGTG CATCGGACAA CGTGAAGGTG ACCAAGGTCG AGTTCTACCT GGACAACGTG CTCAAGCGCA CCGACACGAG CTCGCCCTTT ACCTACAGCC TCAACACGAC CTCCGTAAGT GATGGCACCC ATACCCTGAC CGCCAAGGCC TATGATGCGG CAGGCAATAT CGGCGAGACA ACCGTGACGG TGAAGGTCGC CAACGATGCA ACCGCGCCGA CCGTGTCACT GTCGGCCCCG ACCAGCGGCG CAACGGTAAG CGGTGTCGTC TCCGTCAATG CCACCGCAAC GGACAACCTG GCAGTTGCCA AAGTCGAATT CTACGTCAAC AATGTCCTCG CGAGCACGGA TACGACCTCT CCCTACAGCT ACAGCTGGGA TACCTCCACC GTTGCTAACG GCGTCTACAG CCTGACCGCC AAGGCCTATG ATGCGGCCGG CAACTCGAAG GTTTCGACCG CCGTAACCGT AACCGTCAAT AACATCGTGA TCATCAAGGG CGATGTGGAT GGCGACGGAG CGATTACGGC CAATGACGCG CTCATCGTTC TCAAGGCGGT CGCGGATCCC ACCCTGTTGA CCTCGACAGT GCAGAGCATG GGTGATGTGG CGCCGGTCGA TCCGGTTACC TCCAAGCCGG TCGGCAACGG CAAGATTGAC ATCAACGATG TACTGATTCT TCTGCGCCGG GCGGTTGGTC TGACGACCTG GTAA
|
Protein sequence | MKTRTLTTLI LAWLLSFTMV AASQARDVTL QWDANTETTV AGYKVYYNAD SAGPPFSGTG TVGKVTSTTL TGLDPSKTYY FAVTAYDATG TESTYSNIVS AAEATAPTVS ITSPASGSSI SGTTSVAISA TDNVGVTSVE LYVDGVLKGT DTSSPYSVSL NTTQLSAGTH TLQAKAYDAA GNVGQSTAFS VTVVNDTTAP TASITSPTSS STVSGTVTVN VSATDAMGVS KVELYVNGSL YATYGSAPYS ITWNTSSYAN GSYTLQAKAY DAAGNVGQSS SVAVTVSNTV ADTTVPTVAV SSPANGATVT GTVSMAATAS DNVGVSKVEF YVNNVLKGSD TTSPYNYSWD TTSTANGSYS LTAKAYDAAG NVGQSSSVTV TVSNTVADTT VPTVAVSSPA NGATVTGTVS MAATASDNVG VSKVEFYVNN VLKGSDTTSP YNYSWDTTST VNGSYSLTAK AYDAAGNVGQ SSSVTVSVNN VTTPPSGSNT AIFGNAFGAN FPNTVEDTFL NINDDVNATG VSLSTYTWPA ATPANAVVMK WDVSALPANA EIQSATLYLY LTEGGGDDAY EIPVSAIINK NPVVASSTGN TYDGTNAWTA SSVAYGGVPL AQSDIDTPVD APLVDKTVGY KAWNITNLVK TWLATPAANR GVLLNSSNKA AVDSYRLFAS SEASDTNLRP KLVVTYNLPS DTAAPTVAVS APANGATVSG TVTVSATASD NVGVTKVEFM VNGTVASTVT TAPYSYSWNT TTSANGTYTL TAKAYDAAGN IGQSTSVSVT VNNQIGDTTA PTVSITSPAN NATVKGSITV SASASDNVKV TKVEFYLDNV LKRTDTSSPF TYSLNTTSVS DGTHTLTAKA YDAAGNIGET TVTVKVANDA TAPTVSLSAP TSGATVSGVV SVNATATDNL AVAKVEFYVN NVLASTDTTS PYSYSWDTST VANGVYSLTA KAYDAAGNSK VSTAVTVTVN NIVIIKGDVD GDGAITANDA LIVLKAVADP TLLTSTVQSM GDVAPVDPVT SKPVGNGKID INDVLILLRR AVGLTTW
|
| |