Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_3794 |
Symbol | |
ID | 6160721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | - |
Start bp | 4252992 |
End bp | 4258913 |
Gene Length | 5922 bp |
Protein Length | 1973 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641666567 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001792813 |
Protein GI | 171060464 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAAAA GCACCCGATT CATCCGCACG CCGATCGCGG CGGCCACCAG TGTCCTGGCG GTCCTGCTCG GCACCGGCTT GACCGCCCAG GCCGGCGTCG GCTTTGGCGA AAGCCAGGAT TTCTCCGGCA ATCCGGTCTC GAACATCCGT TCCCATTTTG CGCACAGCCC GCGTGGCGAC CGCGAGAGCG CCAACCCGAC GGCGGCCGAC GAAGCGGCCC GCCTGTTCGG TGCCGGCAAG CTGATGGGCG CCAGCGCGGT CGACACCGGC CGCGCGCTGC GCAAGTTCGT CGATCCGCTG CCGCTGCCCG GCCAGCCGCA GACGATGGCC GACGGCACCG TGCGTTATCT GCCGGTGGCC GCGCCCACCA AGTGGATCAA CCCGCAGACC GGCCAGCCCA CCGGTGACGA CTACTTCGAA GTGGCGGTCA TCGAATACAA GCAGAAGCTG CACTCCGACC TGAAGAACCC GACCACCATC CGCGGCTACG TCCAGCTGTC GACCGCGGCG GTGCCGGGCA AGCAGGTGCC GCTGTTCTAC CCGGACGGCG TCACGCCGAT CCTGGTGCAG GAGACCGACG CGCGCGGTTT CCTGGTCTTC AACACCGACG GCAGCCGCAA GATGGTGCAG GCGCGTGCGG TCGACGATCC GCACTTCCTC GGCCCGGTCA TCCAGGCCCG CCAGGGCGTG CCGACGCGCT TGAAGTTCAT CAACCTGCTG CCCTTCGGCC GTGCCGAACT CGGCGCGCCC GACGCCGACG GCAAGCCGGT GGTGACTGCC CGCAACGGCG ACATCTTCCT GCCGCTGGAC AAGTCGATTG CCGGCTCCGG CCTCGGCCCG GACGGCTTCA CCGAATACAC CCAGAACCGC GCCAACATCC ACCTGCACGG CGGCGACACG CCGTGGATCA GCGACGGCAC GCCGCACCAG TGGATCGCCC CGGCCGAAGA GGCCAACGCC GCCAACGGCC GCGGTCTGGC CGCGCAGAGC ATCGATCCCG AGTTCCTGCC CAGCTTCCTG CGCGGTCCGG GCGCGATCAA CGTGCCCGAC ATGCCCGATC CGGGCCCGGG TGCGATGACG TACTACTTCC CGAACGGCCA GTCGGGCCGC ATGCTGTGGT ACCACGATCA CTCGATCGGC ATGACGCGCC TGAACGTCTA CGCCGGCATG GCCTCGGCCT ACCTGCTGGG CGATGCGGTG CAGGACGCGC TGATCAGCGG CGGCTCGGCC ACCGTCAACG GCAAGAGCGC CACCTTCCAG GCGGTGCTGC CGCCGGCGGC CGACACCATT CCGCTGGTGA TGCAGGACCG CACCTTCGTG CCGGCCGACG TCGCGCTGCA GGACGCGCGC TGGAACACCA GCGCCTGGGG CGCCGAGAGC GACTCGTGGT TCCCGCACGT CTATGAAACG GTCCAGGATC CGAACCAGCT CAACGGCTTC AATGCCGTCG GTCGCTGGCA CTGGGGCCCG TGGTTCTGGC CGGTGTTCCC GGCGCTGTAC GACCTGCCCT CGGGCGAATA CGGCGACGTC ACCGTCACGC CCGAAGCGTG GATGGACACC GCGATGGTCA ACGGCGTGGC CTACCCGACG CTCGACGTCG ATCCGAAGAC CTACCGTTTC CAGATCCTGA ACGCCTCGAA CGACCGTTCG ATGAGCTTCA ACCTGTTCGT CGCCGACGAC GCCCAGCCGT TCACCGACCC GGTCACCGGC GACGTGCGCC TGACCGAAGT GAAGATGGTC GACGCGGTCA TTCCTGCCGA CCTCTGCAGC GGCGACCAGA CGCGTGCGGT GCAGCCCGAC GGCAGCATCT GCACGCCCGC CACCTGGCCG ACCGATGGCC GCGCCGGTGG CGTGCCGTCG CCGGCCTCGC AGGGCCCGAC GCTCTACCAG ATCGGCAGTG AAGGCGGCCT GCTGCCGCAG GTGGCGGTGA TCGACCCGGT GCCGGTCAAC TACAACTACG ACCGCGGTCG CATCACGGTG CTCAATGTGC AGACCCCGGC GCTGCTGCTG GGCAATGCCG AACGCGCCGA CGTGGTGGTC GACTTCTCGC AATACGCGGG CAAGACCCTG ATCGTCTACA GCGATTCGCC GGCCCCGATG CCGGCCGGCG ACCCGCGCAA CGACTACTTC ACCAACGTGG GTGACCAGTC GACCGAGGGC GGTGCCGAGA ACACCAAGCC CGGCTACGGC CCGAACACCC GCACCTTCAT GCGCATCAAG GTGCGTGCGG CCGCCCCGGC GCCCGCGCTC GACGTGGCCG CGCTGAAGGC CGAGATCCCC AAGGCCTACG CGCTGTCGCA GGAAACCCCG GTGGTCGGCC AGAAGGAATA CAACACCGCC TTCGGCACCA GCTGGACCGA CAGCGGCGCC TACGCCGACA TCTACGCCGG CTCGCTCAAG CAGCCGCTGT TCAAGTACAC CCCCGGCACG CCCAACGGCG GCGGGTTCAA CAGCGTCAAG GTCACGCAGA TCGGCTCGGG CTACGTGAGC GCACCGACCG TCACCTTCGC CGACAGCACG GTCGGCAACG AGGTCGGTGC CAAGGCCCAG GCGACGCTGA AGATGAGCGC CATCACCGTC ACCGATCCGG GTGCGGGTTA CGTGTCGGCA CCGATCGTGA GCATCGTCGC CCAGTCCGGC GGCGGCTCCG GCGCCGTGGC CGAAGCGCGT CTGGCGATCG ACAAGATCAC CATCACCAAC GGCGGCGCCG GTTACACCTC GGCTCCGGCG GTGCGCTTCT CGGTGCCGCC CACCGGCGGC GTGCAGGCCA GCGGCACGGC CATCGTCACC AACGGCCGCG TCACCGGCGT GACCCTCGAC AACCCGGGTT CGGGCTATGT GGGCGCACCG ACCGTGAGCT TCCAGGGCGG TGGTGCCACC ACCACGGCAC GTGCCACGGC CACCGGCAAG ATCTCCGACG TCAAGCTGCT GTCGCTCGAC CCGATGAACC CGATCGTCTA CAACGCCGAC GGCAGCATCG CCTCGCTCGG CGCAGCCGGC GGCGGTGGCT ACACCGACAT GAGCCAGGTG CTGATCAACT TCAACGGCGG CGTGGCTCCG GCCGGTGGCC GCGCGGCGAT CGCCAGCGCC TCCGGCAGCC TGTTCGACGT GACGATGGTC AACCACGGCG TCAACTACAC CGCCAACACC ACCATTGCCT TCAGCGGCGG CGGCGGTTCG GGTGCGGCGG CCCAGGTCGA CACGCTCAAC GGCGGCACCG CGACCGGCTC CAACCTGGTC AAGACCAAGG CGATCCACGA GCTGTTCGAG CCGACCTTCG GCCGCATGAA CGCCATCCTG GCGGTGGAAA TCCCGTTCAC CAGCGCGCTG ACGCAGACCA CCATCCCGCT GGCGATGATC GACGCCCCGA CCGAGCGTTT TGCCGACGGC GAGACGCAGA TCTGGAAGAT CACGCACAAC GGCGTGGACA CCCACCCGGT GCACTTCCAC CTGCTCAACG TGCAGCTGAT CAACCGCGTG GGCTGGGACG GCTGGATCGA CCCGCCGGCT CCGAACGAGC TGGGCTGGAA GGAAACCATC CGCATGAACC CGCTCGAGGA CGTGATCGTG GCGGTGCGCG CCAAGCGTCC GCCGCTGCCG GGCTTCGGCG TGCCCAACAG CATCCGGCCG ATGGACCCGT CGCAGCCGAT CGGCTCGCCG TACGGCTTCA CGCAGATCGA CCCGAACACC GGCACGCCGC TGTCGGTGGT CAACGAAGTC ATGAACTACG GCTGGGAATA CGTGTGGCAC TGCCACATCC TCGGCCACGA GGAAAACGAC TTCATGCGTC CGATCGTGTT CGACGCCAAC GAAGCCGTTC CGACCGCGCC GGGCGCGCTG ACCGCATCGG CCAACGGCTC GGGTTCGGGC GTGCTGCTGG GCTGGAGCGA CACCTCCGCC ACCGAGTACC AGTTCCGCGT CCTGCGGGCC ACCGGCGCAG CCGGCACGGT CTTCACGCCG ATCGGCACCG CACTGGCCAA CGGCGGCAGC TACCTCGACA ACACCGCCCA GCCGGGCACC AGCTACCGCT ACCAGGTGGT GGCGGTGGGT GCCAACGGTG AAGCGGCCTC GGGCATCGCC GACATCACCA CGCCGACCGG TGCCCCGGTC GTGCCGGCCG CCGTGCAGGC GACCCAGCTG AGCGCCGACA GCGTGCGGCT GCAGTGGCTC GACCAGTCGA CCGACGAAGC CGGCTTCGCC GTCGAGGTGT CGGTCAACGG CGGCGCCTTC GCGGCACTGA CCACGGTGGG CAGCAGCGCC GCCAACGTCA CGGCCACCGG CATCACCGTC ACGTTCGACA ACGCAGGTGC CGCGGTCGGC AGCACCTACA CCTACCGTGT GGCTGCCGTG AACGCCAGCG GTGCGGCCTC GGCCTACGTC AACTCCAACA CCGTCACGGT GATGGGCCCG CCGGCTGCGC CGAACACGCT GACGGCCATC GTCAACTCGA AGACCCAGGT CGCACTGAAC TGGATCGACG GCTCGACCGA TGAAAGCCAG TTCGTGATCG AGGCGTCGGT CAATGGCGGC GCCTTCGCCG CGGTCAGCAC GCTGGCCACG CCGTCGGCCG GCGCCAGCGG CGGTGCGGTC ACCACCAACG TGGCGGTGGT CAACGGCAAC ACCTACGTGT TCCGTGTCGC TGCGTCCAAC ACCTGGGGCA GCTCGACCTA TGCCACCTCG GCCTCGGTGC CGGTGATGAT CGCCCCCAAC GCGCCGACGG CTCAGACCGT GTCGGTGGCC GGCGCAGCGG TGACGCTGAA CTGGCTGGAC AACGCGACCG ACGAGACCAG CTTCGTGGTC GAAGGCTCGC TCAACGGCGG CGCCTACGCC ACGGTGGCCA CGGTGACGCG AACGGCGGCA CAGGCCACCG ACAGCGCGAC GCCGGTTTCG ACCAGCGTGA CCGCAGCCGC GGGCACCTGG ACCTACCGCG TGCGTGCGAT CGGCGCGGGG GGTGCCTCGG CCAACAGCGA CTGGGCCAAC AGCGTGACGG TGACGGCCAC CGGCCCGGCT GCGCCCACCA CGCTGACGGC GGTGCTGCAG TCGTCGACCC GGGTTCGGCT GAGCTGGGTG GATGCGTCCA CCAACGAGAC CAGCTTCCTG ATCCAGCAGT CGGTCAACGG CGGCGCCTTC ACGCAGATCG GCACCGTGAA CCGCAGCGCG GCGCAGGGCG CAGCCAGCGG TGGCGTGCTC AGCTCGCAAC CGACGGTCGC GGCCGGCAAC ACGTATGTGT TCCGCGTCAT TGCACGCAAC GCCTCGGGTA GCTCGGCTCC GGCCGACGTC ACGATCACCG TTGCCGTGGC ACCGGCGGCC AACCTGGCTG TGGCGGCTGT CAGCGCCACC TCGGCCCGCC TGAGCTGGAC CGACGGCGGC CCGCTGGAGA CCGGCTACCG CGTCGAGCGC AGCACCGACG GCGTCAACTG GACCGTGTTG AGCACCACCG CCGCCAACGC CACCGGCTAC ACCGCCACCG GCCTGACGAC GGGCACGACC TACCAGTTCC GCGTCACCGC GGTGCGTACC TCGGGTGGCG TCACGACCAC GGCCACGCCG GTGGAAGTGA GCTACACGGT GGTGGCACCG CCGGTGCCGA CCGCGCCGAG CGCCCTGGCG GTCAACGCCA CCGCGCAACG CTCGGTCACG CTGGGCTGGG TGGACAACGC CAACAACGAG ACCAGCTACC GCGTCGAAGC CTGCCTCGGC ACCTGCACCG ATGCATCGAC CTGGGTGCTG GCGGCCACGT CCACCGCGAG CGCCACGCAG CAGACGGGCA CCGGTGCCCG CACGCTGACG GTCAGCCGCA TCTCCGCCAA CGGCAACAAC CGCCTGGCCG CTGCCACGAC CTACAGCTTC CGGGTGGTCG CGGTGGGTGC CTCGGGCAAC AGCGGGGTCA GCAACATCGT GACCGCCACG ACGCTGCCCT GA
|
Protein sequence | MRKSTRFIRT PIAAATSVLA VLLGTGLTAQ AGVGFGESQD FSGNPVSNIR SHFAHSPRGD RESANPTAAD EAARLFGAGK LMGASAVDTG RALRKFVDPL PLPGQPQTMA DGTVRYLPVA APTKWINPQT GQPTGDDYFE VAVIEYKQKL HSDLKNPTTI RGYVQLSTAA VPGKQVPLFY PDGVTPILVQ ETDARGFLVF NTDGSRKMVQ ARAVDDPHFL GPVIQARQGV PTRLKFINLL PFGRAELGAP DADGKPVVTA RNGDIFLPLD KSIAGSGLGP DGFTEYTQNR ANIHLHGGDT PWISDGTPHQ WIAPAEEANA ANGRGLAAQS IDPEFLPSFL RGPGAINVPD MPDPGPGAMT YYFPNGQSGR MLWYHDHSIG MTRLNVYAGM ASAYLLGDAV QDALISGGSA TVNGKSATFQ AVLPPAADTI PLVMQDRTFV PADVALQDAR WNTSAWGAES DSWFPHVYET VQDPNQLNGF NAVGRWHWGP WFWPVFPALY DLPSGEYGDV TVTPEAWMDT AMVNGVAYPT LDVDPKTYRF QILNASNDRS MSFNLFVADD AQPFTDPVTG DVRLTEVKMV DAVIPADLCS GDQTRAVQPD GSICTPATWP TDGRAGGVPS PASQGPTLYQ IGSEGGLLPQ VAVIDPVPVN YNYDRGRITV LNVQTPALLL GNAERADVVV DFSQYAGKTL IVYSDSPAPM PAGDPRNDYF TNVGDQSTEG GAENTKPGYG PNTRTFMRIK VRAAAPAPAL DVAALKAEIP KAYALSQETP VVGQKEYNTA FGTSWTDSGA YADIYAGSLK QPLFKYTPGT PNGGGFNSVK VTQIGSGYVS APTVTFADST VGNEVGAKAQ ATLKMSAITV TDPGAGYVSA PIVSIVAQSG GGSGAVAEAR LAIDKITITN GGAGYTSAPA VRFSVPPTGG VQASGTAIVT NGRVTGVTLD NPGSGYVGAP TVSFQGGGAT TTARATATGK ISDVKLLSLD PMNPIVYNAD GSIASLGAAG GGGYTDMSQV LINFNGGVAP AGGRAAIASA SGSLFDVTMV NHGVNYTANT TIAFSGGGGS GAAAQVDTLN GGTATGSNLV KTKAIHELFE PTFGRMNAIL AVEIPFTSAL TQTTIPLAMI DAPTERFADG ETQIWKITHN GVDTHPVHFH LLNVQLINRV GWDGWIDPPA PNELGWKETI RMNPLEDVIV AVRAKRPPLP GFGVPNSIRP MDPSQPIGSP YGFTQIDPNT GTPLSVVNEV MNYGWEYVWH CHILGHEEND FMRPIVFDAN EAVPTAPGAL TASANGSGSG VLLGWSDTSA TEYQFRVLRA TGAAGTVFTP IGTALANGGS YLDNTAQPGT SYRYQVVAVG ANGEAASGIA DITTPTGAPV VPAAVQATQL SADSVRLQWL DQSTDEAGFA VEVSVNGGAF AALTTVGSSA ANVTATGITV TFDNAGAAVG STYTYRVAAV NASGAASAYV NSNTVTVMGP PAAPNTLTAI VNSKTQVALN WIDGSTDESQ FVIEASVNGG AFAAVSTLAT PSAGASGGAV TTNVAVVNGN TYVFRVAASN TWGSSTYATS ASVPVMIAPN APTAQTVSVA GAAVTLNWLD NATDETSFVV EGSLNGGAYA TVATVTRTAA QATDSATPVS TSVTAAAGTW TYRVRAIGAG GASANSDWAN SVTVTATGPA APTTLTAVLQ SSTRVRLSWV DASTNETSFL IQQSVNGGAF TQIGTVNRSA AQGAASGGVL SSQPTVAAGN TYVFRVIARN ASGSSAPADV TITVAVAPAA NLAVAAVSAT SARLSWTDGG PLETGYRVER STDGVNWTVL STTAANATGY TATGLTTGTT YQFRVTAVRT SGGVTTTATP VEVSYTVVAP PVPTAPSALA VNATAQRSVT LGWVDNANNE TSYRVEACLG TCTDASTWVL AATSTASATQ QTGTGARTLT VSRISANGNN RLAAATTYSF RVVAVGASGN SGVSNIVTAT TLP
|
| |