Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1055 |
Symbol | |
ID | 3747037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1431759 |
End bp | 1436114 |
Gene Length | 4356 bp |
Protein Length | 1451 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637773585 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_379360 |
Protein GI | 78189022 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0430168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGGA TATTTAACGT GATCTGGTCG GTTACCAGAG AAAAGTGGGT TGTGGTTTCA GAAAAAGTTA AATCAAATGG TTCCGTACCA AAATCATCAT TAGTAAGCAT TGCTTTTCTT TCGGCATTGC TTGGTGGTGG CAGTGTAGCT CAAGCCGTTG AGCCCGGGCA GTTACCAACG GGCGGCGTTA TTACAGCAGG TAGTGGTTCC ATTGCTACAA ACGGCAACAG CATGACTATT CAGCAGTCAA GCCAAAAAAT GGTGGCTAAT TGGAACAACT TTAATGTTGG CAGCGATGCA AGTGTGCGCT TTCAGCAGCC AAACGCTTCT GCCGCTGCAC TCAACCGTAT TGCCGGGCAA AATCCGTCAC AAATTCTTGG CTCACTTTCT GCTAACGGAC GCGTTTTTCT TATCAATCCA TCAGGTATTG TATTTGGACA GAATGCTCGT GTTGATGTCG GTGGATTGGT TGCTTCAACA CTTGATATTT CCGATTACGA TTTTCTTGCT GGTAACTTCG CTTTTCGTTC AACGGGATCG GCTGGCACCT TGCGCAATGA AGGATTAATT AACGCTATGC CGGGTGGCGT GGTGGCACTC TTAAGCCCTT CAGTTATCAA TAACGGCACC ATTACTGCGG TGGGCGGCAG TGTAGCGCTT GCAGCGGGAA ACCAAATGAC GCTTGACTTT GGCGGCGATG GCTTAATGAC TGTTCGAGTT GATGACGGTG CGGTCAATGC GTTTGTGGAG AACAATTCGC TTATTAAAGC TGATGGCGGA TTAGTTGTAA TGAGCGCAAA AGCCGCTAAC AATCTTGCTT TTTCTGCAGT TAACAACAAC GGCGTCGTGC AAGCAATGAG CGTTGTTGAA AAAAATGGAC GCATTTTGCT TGATGCCGAA GGTGGGCAAA GCACGGTTTC TGGTACGCTT AATGCTTCAT CAGTTGATGG TAAAGGTGGT CAGGTTGTGG TTACAGGCAA GCAGGTAATG ATTGCCGATG GTGCTCACCT TAATGCATCA GGTCTTACGG GTGGTGGCGA CGTGTTGGTT GGTGGTAGTT GGCAAGGTAG CGATGCCTCC GTTCGTCAAG CCGTTGGCAC GGTGGTAATG CCTAATACGC TTTTGCAAGC TAATGCAATT AGTAATGGCA ATGGTGGTAC AGTGGTTGTA TGGTCGGATG TTAACAATCC GCTTTCAGTT ACTCGCGCTT ACGGCACGTT TGAAGCCTTT GGTGGAACAA ATGGCGGCAA TGGTGGACGT ATTGAAACTT CTGGTCATTG GCTTGATGTT GCAGGTTCAC GCGGTGGCGC TTCGGCGGTA AATGGCAATG CGGGTGTGTG GCTGCTTGAT CCGTATAACG TAACGATTTC TTCATCTAAT GCTAATGGTT CTTGGGGTGG TGTTTTTCCC AATGCTATTT GGACTGCAAG CGGAGATAAC TCGAATTTAC TTGCTTCTGA TATTACAACC CGACTTAATG CTGGCACGAG TGTTACGGTT CAAACAGGCA CGGCAGGAAG TCAGGCTGGT GATATTACGG TTGATGGAGC TATCAACATG ACCAATGATA GTGGGGAGGT GTCATTGCAA TTAGATGCAG CAGGAAGTAT TGCTATTAAC AATAATATCA CCAACTCTAC AGGTACACTT CATCTTGTGT TTAACAGTGG AACAGGTGCT ATTAGTGGAA CAGGTGCCTT AGGTAGTGGT CAGGGAAGAA CTCTTTTTAA TGTTGGTGCT TCTACTGGTA CTTTTTCTGG AATAATTAGT GGTGCTAGTC GTACTGTTAC AAAGCAAGGA GCTGGCACGC TAATATTTTC TGGAGCTAAT ACTTATGGTG GTTTAACTTC TATTGAGGCT GGAGTATTAA GAGTAGCAAA TGCTCAAGGA TTAGGTGATG TAACAAATGG TACTCAAGTT TCTAATAATG GAGCATTGGA ACTTTCTGGG GGAATAGTAA TAACCGGAGA TGAGGTTCTT CGTTTAGTAG GTACAGGGGT TAGCAATAGT GGCGCATTAC ATAGTATAGG TAATAATAGT TTTGGAGGAC ATATTATTCT TACCGGAAAT AGCACTATTA CTTCCGATAC CAATGGAACG TTAATCTTAG GCAATGCCAG TCAAGGTATT TACGGTGCTT ACGGATTAAC TCTTTCAGGT GGCGGCAGTG TTGTATTTAA TGGGGCTATT GGCGCTACGA TACCACTCGC CTCATTTCAT GGATTAACAG GAACGTCTAT TGAGCTTAAT GGTGGTTCAA TTACAACGAC CGGCGTAATT AGTGCTCTTG GTCAAGTTAA AGCTACTAAT CCATTAACGT TATCGTCTGG TATTAGTGAT ATTTCGTTAT CAAATGAAAC TAATGACTTT ACAACGGTAA CGGTAACAAA TGCTGGTGCA GTATCGCTCA TTGATGATAC TGCATTGACG TTAGCTGGTG TTAATGCAAG TGGTGATGTG AATATTGCAA CCCATACTGG TAATTTGACG GTTACGGGTA ATGTTGCAAC AACAAGTGCA ACACCAACCG CGTTAACCTT AAATGCTGAT CAAAGCAAAG ATGCTGGTAA TGGCAATGGA GAAAATCTCA TTCTTTCAAG TGGTACTCTT ACTGTTGGTT CGGGTGGTAT TGCTAAACTT TATACTGGCA GTGTAGCTGG TAGCACATCA ATTGCTTCAG TTGTTAATGC AGGTCATTTC CGCTACAATA GTGATGAAGC GGTACAACAT TACACTGATC CATTAACTGC TGGTTTAAAC CTCATTTATC GTGAGCAACC AACGCTTTCA GTTATGTTTG CTCCTGTAAC TACGACGTAT GGTACAACTC CAACATTTGC GATAAGTTCG TATAGCGGTT ATATAAATGG AGATACTTCC CCAGGAATTG TTACTGGTAC GCCAACATGG TTGGTAGATG GAACGCCTTC TTTTGCAGGA TATTATACTG CTGGTACTCA CAATGTTTCA TACAATAATG GACTTATCAG TAGTCTTGGT TATGGTTTTG TTGATAATGC AATTAGTTTT AATGACTTAG TTGTTAATCC ACTGGTGTTA GCAGCGACAT CTTTAACGGG TTTAACGGCA TCAGATAAAA TATACGATGG TCAAATAACT GCTACTATAA GTAATTATGG TACGCTGACA GGAATACTGA CGGGAGATCG TGTTGCATTA AATAGTGCTG GATCAAGTGC AGCTTTTGCA GATAAAAACG TCGGTACCGG TAAGACGGTA ACGGTAAGTG GTTTAACGTT GTCTGGACTT GATAATGGTA ACTACCGTAT AGTTCCACAA ACAACGACAG CATCTATTAC CCAAAAATCG CTAAATGTTA CAGCTCCAAG CAACGTAACC AAAGTGTATG ATGGCACGGT AGCGGCTCCA GGTGTTGCTA CCGTCACTGG TCTTGCCATC GGTGATGTTG TAGCGGGAAC AGCAACTATT GAATATGCCG ATAAAATGGC TGGCAGTAAT AAGGTTGTTA ATCCGTTAAG TGTAACGATT CTCGATGGGT TCGATATGAT TATGACTAAC AACTATGCAA TTACTTATGT TGGCGATCAT GGTACTATTA CCCAAGCACC ATTAACGCTT ACAGCGCCCG ACAACGTTAC GAAGTATTAT GATGGATTAC TTACTGTTCC AGGTACTCCA AGTGTTAACG GTTTAGTCCC TAACGATGTG GTGGTTATAC CGGCATCTCT GCTTTATACT GATCCTGAAG TGGGAATAGG TAAAACTGTT AATCCTGATT CAGCAGGCTT GGTGATTCAT GATGCTATAG GTAATAATAT GACTCCAAAC TATGCCATTA CTGATATAGC AAGCCACACT GGTATTATTG TTGAAAAAAC CTTTACACCA TTTAAAAAAT GGAATGATGC TGATCCTTCG GTGCCCGAAA TACCCACTAA TGCCCCTGAG GTAACCGGTT CTCGCGATTT AGCGGGAAGC GATTTTGAGC CGGCAACCGA TAGTGGAGTA ACAGCTACTC GCTCACTCAC AATGGCTACC ATGGATGAAA GTGCAGTGCA ATCTGATATT GTGGTGAAGC TTGCGGAGCC TGCATCTAAA AATAAGCAAG GTGTGGTTAA GGTTTTTGTA CCAAAAGAGG TGTTTGCAAA GCCTGCTTTC TTGTTCCCAT TACCTGAGGA GGTAGCTGTT GAGATAAATA AAACTAACGT GCAGGAGAAG GTTTTCATGC AAAATGGTGA TGCGCTGCCG GGTTGGTTAA GTTATGACTA TGAGAAAAAA ATCTTTACAG CAACAAGCGC TCCTGCTGGT TCGTTACCGC TTACGATTAT GGTTCAATCG GGGACGATGG CTTGGCAAGT GATTATCCAA CAGTAG
|
Protein sequence | MNRIFNVIWS VTREKWVVVS EKVKSNGSVP KSSLVSIAFL SALLGGGSVA QAVEPGQLPT GGVITAGSGS IATNGNSMTI QQSSQKMVAN WNNFNVGSDA SVRFQQPNAS AAALNRIAGQ NPSQILGSLS ANGRVFLINP SGIVFGQNAR VDVGGLVAST LDISDYDFLA GNFAFRSTGS AGTLRNEGLI NAMPGGVVAL LSPSVINNGT ITAVGGSVAL AAGNQMTLDF GGDGLMTVRV DDGAVNAFVE NNSLIKADGG LVVMSAKAAN NLAFSAVNNN GVVQAMSVVE KNGRILLDAE GGQSTVSGTL NASSVDGKGG QVVVTGKQVM IADGAHLNAS GLTGGGDVLV GGSWQGSDAS VRQAVGTVVM PNTLLQANAI SNGNGGTVVV WSDVNNPLSV TRAYGTFEAF GGTNGGNGGR IETSGHWLDV AGSRGGASAV NGNAGVWLLD PYNVTISSSN ANGSWGGVFP NAIWTASGDN SNLLASDITT RLNAGTSVTV QTGTAGSQAG DITVDGAINM TNDSGEVSLQ LDAAGSIAIN NNITNSTGTL HLVFNSGTGA ISGTGALGSG QGRTLFNVGA STGTFSGIIS GASRTVTKQG AGTLIFSGAN TYGGLTSIEA GVLRVANAQG LGDVTNGTQV SNNGALELSG GIVITGDEVL RLVGTGVSNS GALHSIGNNS FGGHIILTGN STITSDTNGT LILGNASQGI YGAYGLTLSG GGSVVFNGAI GATIPLASFH GLTGTSIELN GGSITTTGVI SALGQVKATN PLTLSSGISD ISLSNETNDF TTVTVTNAGA VSLIDDTALT LAGVNASGDV NIATHTGNLT VTGNVATTSA TPTALTLNAD QSKDAGNGNG ENLILSSGTL TVGSGGIAKL YTGSVAGSTS IASVVNAGHF RYNSDEAVQH YTDPLTAGLN LIYREQPTLS VMFAPVTTTY GTTPTFAISS YSGYINGDTS PGIVTGTPTW LVDGTPSFAG YYTAGTHNVS YNNGLISSLG YGFVDNAISF NDLVVNPLVL AATSLTGLTA SDKIYDGQIT ATISNYGTLT GILTGDRVAL NSAGSSAAFA DKNVGTGKTV TVSGLTLSGL DNGNYRIVPQ TTTASITQKS LNVTAPSNVT KVYDGTVAAP GVATVTGLAI GDVVAGTATI EYADKMAGSN KVVNPLSVTI LDGFDMIMTN NYAITYVGDH GTITQAPLTL TAPDNVTKYY DGLLTVPGTP SVNGLVPNDV VVIPASLLYT DPEVGIGKTV NPDSAGLVIH DAIGNNMTPN YAITDIASHT GIIVEKTFTP FKKWNDADPS VPEIPTNAPE VTGSRDLAGS DFEPATDSGV TATRSLTMAT MDESAVQSDI VVKLAEPASK NKQGVVKVFV PKEVFAKPAF LFPLPEEVAV EINKTNVQEK VFMQNGDALP GWLSYDYEKK IFTATSAPAG SLPLTIMVQS GTMAWQVIIQ Q
|
| |