Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1512 |
Symbol | |
ID | 3747145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1980792 |
End bp | 1986791 |
Gene Length | 6000 bp |
Protein Length | 1999 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637774051 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_379810 |
Protein GI | 78189472 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACTC ATCCTCTTTT TTTTCCATTG CATGGGCGCG ATGTTTTTGT GGTTGCCTTG TGTGTAACGC AGTTGTTGCT CGTTGTGCCA CAAGCTCAAG CACTGCCAAC GGGTGGAGCG GTGGTGGCTG GTTCGGCAAA CGTTACGCTC CCGTCGGCAA CAACCATGCA GATTGAGCAA GCAAGTCAAA AAGCTATTAT TAATTGGCAA TCCTTTGGAG CGGAGCGTGG CGAGCGGGTG CAGATTGTGC AACCTGAGAG TTCATCGGTG CTGCTCAATC GGGTGATTGG CAATAATCCC ACCTCTTTTT TTGGGCAGTT GCAAGCTAAT GGGCAGGTTT TTCTGGTTAA TCCCAATGGC ATTTACTTTG CTCCGACTTC CCAGCTTAAT ACAGGTGGTT TAGTGGCTTC AACCTTATCT CTCAACGATC GTGATTTTCT TGCGGGTAAC TATGCTTTTG TGGCGCAAGG GGCAATGGGA GCCTTGCTGA ATGAGGGAAC TTTGCAAGGT GGATTTGTGG CGCTGCTTGG CAGTAATGTG GAAAATCGAG GGGCGATTGT AACAACGCGA GGCACGGCGG CATTAGCGGC GGGAGAGGCT ATGACCCTTA ACCTTGATGC TTCAGGATTG GTTGCGCTTA CGGTTGATCA AGCCGCGTAC AACGCTCATA TTCGCAATAG TGGCATTCTT GAAGCTGAGG GTGGAACGGT GGTGCTTAAC GCAGGTGCAG CGGAGGATGT GTTGGCGGGC GTAGTGAATA ATAGTGGACG AGTGGTGGCT ACCAGCGTGA GTGAGCGAAA TGGGGCAATT GTTATTGAAG GTGGTTCATT GGTGCAAACG GGTGAGGTTG TTGCGCCAAC CATTAATGTA GCGGTGAATC GCATGGTGGA TGCGGGCTCG TGGCGTGCGG AGCAAGGGAA CATTACCATA CATGCAGCAA CAACCATTGA GCAAACCGCC GCAAGCCACA TAAGTGCATC GGGTAAACAA GGTGGATCGG TACGGCTTGA GGCTGGTAAG CAGCTTTATC TATCGGGTGC AATTGAATCG AATGGCACGG ATGGGCAGAG TGGTAGCGGT GGAACTATTG CAGTAACATC TCCAACCACA ACGATTGCGG GTGCAACACT GAGCGCTAAT GGTGGCACCG ATGGCGGCAT GGTGCTGATT GGTGGCGGTT GGCAGGGAAG CGAGCCAAAC CTACCAAATG CCGCAACAAC AACGGTTACT GCAAGTAGTT CCATCAGCGC TAATGCAAGC ACGGTTGGCA ATGGCGGTAC GGTGGTTGTG TGGTCGGAGC AAGCAACAAC CTTTGCGGGA ACTATTGCTG CCAATGGTGG CAGTGAGTCG GGTAATGGTG GGGCAGTTGA GGTGTCGGGG CATGAGCAAC TTGCTATGAG TGGCACGGTG TCAACATCAG CTCATCATGG CGAGGCGGGA TTCCTATTGC TTGATCCGCG TAACATCACG ATTGAACAAC CTCTTTTACT ATCGCAATTT CAATTCCAAT TAATCTCGCT TCTTGATCCA AACGCAACAG CAGGCAACCA GCATGGTTCA GGCGCTATTC TTGAGTTGCT GAATGGTAAT CTTTTGGTTA CAAGTCCGCT TGATGATGTT GGGGGAAGCG ATGCGGGTGC GTTGCGCTTG TACCGCCCTG ATGGCACGTT ACTTTCCACG CTAACAGGCT CAGCAACGGG CGATTTAAGC GGGGGCACCA TTACTCCGCT GCAAGGTAAC AGCAATGCTG TTTTTTTAGC TTCCAATTGG TCAAATGGCA CCGCAGCAAA GGCAGGTGCT GTAACGTGGA TTGATGGCAC GAATGGTGTA AGTGGCACTA TTTCGGAGGG CAACAGCTTT GTTGGCACCC ATGCAAACGA TGGGATGGAT GCCGAAGTAA TTGCACTGAG CAACGGAAAT TATGTAGCGC ATTTGCCAAG TTGGCAACAT GATGAAGTGC TCAACGCAGG TGCTGTGGCT TTTGGGAATG GTACAAGTGG TTCAGCGGGC ACTATCAGCG AAGCCAACAG CCTTGTTGGT ACAAAAGCCA ACGATAGTGA TTCAGCCAAA GTAGTTGCAT TAACAAATGG TAATTATGTG GTTGCTTCGC CGCTGTGGGA TAATGGTTCA ACAACCAACG TGGGCGCGGT TACATGGGGA AACGGTCAAA CGGGAAAGGT AGGTGCTATC AGTGGTAGCA ATAGTCTTAT AGGCACAAAA AGTGGCGATA ATGTGGGGTT GCAGGTAACC TCGCTTGCCA ATGGCAACTA TGTTATTGGC TCGCCAAATT GGGATAATGG ATCAACCGCT AATGTGGGAG CGGTTACTTG GGCTGATGGA AATTTGTCAA TACACGGTGC GCTTAGTGCA ACAAATAGCC TTGTTGGGGC TAAAAGTGGC GATTACGTTG GCTCATCGGT AACGGCACTT ACTAACAGCA ATTACGTTGT GGTTTCGCAA TCGTGGAGTA GCGATACGGC AACGGACGTA GGTGCAGTAA CGTTGGGTCA TGGTGATGCA GGAACTACTG GAGTAGTAAC GGCTGACAAT AGCCTTGTGG GTAGTTCAAC GGGTGATGGC GAAAAGCTTT CAGCAACGGC TTTAGCCAAT GGCAATTTTG TAGTGGTGGC ACCAAAATGG GATGGCGATG CAACAAACAT GGATGTTGGT GCCGTTGTGT TGGGCAATGG CGTAACGGGT AGCGTAGGGC AAATTTCTGC CACCAACGCT CTTGTTGGCA CTACGGCTAA CGATTTAGAG AGCGCAACAG TAACCCCACT TACCAATGGC AACTATGTGG TAGCAGCCAC CAAGTGGGAT AATGGGGTGG TAGCTGATGC GGGTGCGGTT ATTGTTGGCA GTGGTACAAC GAGCATAACG GGTACCATCA GCGCCGCAAA TAGCCTTGTT GGAAGTGTGA GTAACGATTT ATTATCAGCC ACCATCACAC CTTTAACCAA TGGCAATTAT GTGGTTGCTG CCTCAAAATG GGATAATGGT GCGGTTCTTG ATGCAGGTGC GGTGGCATGG GGCAACGGTC AAGCTGGCAC GGTTGGTTCC ATAAGCGAAT CCAACAGCCT TGTTGGTAAT AAGAAAGATG ATTTTAGTGG TCTCACTATT ACCGCTTTAC ACAACGGTAA TTACGTGGTT TCGGCTTCTT TATGGGATAA TGGATCCATA ACGAACGTTG GAGCGGTAAC GTGGGGCAAC GGTCAAACAG GTACGGTTGG GACTATCAAT AGCACTAACA GCTTGATAGG TGCAAAATCG GGCGATAAAG TTGGTGCCGT TACGGTTGCG CTATCGGATG GCAATTATGC CACAGCTTCG GGGGAATGCG ATAATGGCTC TCTTGCCAAT GCAGGCGCTG TAACGTTTGG AAATGGCGGG GGTGGAACGG TTGGCGTTGT TTCGTCGGCG AATAGTGTGA TGGGTAGTGA AAAGGATGGC AAAATAGGTT CTGGCGGTTT AACTCCACTT CGTGTTGGAA GCGTGTCGGG TGGCGTTGTG GTAAGTTCGC CACTTGCCCA AGCAAGCAAT GGAAACGTTA CGTTGTTCGC TCCTTCAACT GCCAATGAAG CAGGGATGCT GTCAGCCGAT TACACGTATG CCGCAGATGG TAGCAGCAAT GTTACCGTAA CTCCCACCCA GCTTGCTACC TTGCTTAATA ACGGTACAAG TGTGCGGTTA CAAGCAAGCA ACACCATAAC GCTCAATACT CTTCTTACCG CTAACGCTTC ATCTTCCACA ACGCTTGAGT TGCATGCGGG TAAGAGCATT TTGTTGAACA ACTCCATTGT TACAGGCAAT GGCAACCTCA CCCTTATAGC AAACGATAGT GCGGAACATG GCGTGGACAA TACCTTGCGT GAATCGGGGG CGGCGGTTAT TTCAATGGCT TCGGGCACAG CTATCAATGC TGGTACGGGG CAGGTTGTTG TAGAGTTGCG GGATGGGGGA GAACGCGCCA ACAATGCTTC GGGGGATATT ACCCTTGGAA GCGTAACCGC AGGCACTATC TCTGTTGCCA ACAACGGTAG CTCAAATAGC TCAGGCGTTG TGCTTGCAGG TGCGGCGCTA ACAGCAAACG AGAGCAACGG ATCCACCATT GTGCTTTCGG GGCAACATTT TACCAACAGC GCCAACGCAA CACTTAATAC CGAACCTGAA GCGCGGTGGC TTATTTATTC ATCATCCCCA GAAGCCACTC AAAAAGGAGG ACTTACCTCA TCATTTCGCT CTTACAACGT GCTACCCGCA ACCTATGCAG CCGCAGCAGT TACCGAACAA GGTCATGGCT TTTTATATGC TTCAGCACCG AGCCAGCTTG GAGTCAATAT CACCCTTAAC AATGGCAGTG CAAGCAGTGT GTATGGCAAT GAGCCTAATG CCACATTGGG TTACTCGCTG CATGGTTTTG CCGATAATGA AGAGAGTGCC AACACTATTG GACTTGAGGG TTCTATGCAA GTGAGCGGTA TGCCGAATAC TACCTCGTCC GTCGGCACCT ATAACGTAGC TTATGCGGGC GGTTTAACAA GTAGCAAAGG TTTTACCTTT ACAGCAGGCA CGCCACTTGC GCTTACGGTT GAACCACAAC CAATTACGGT AAACCCTGAT GATCAAGAAA AAACCTATGA TGATACCGAT CCCGATTTAA CATGGCAGGT GGAGGCTCAA GGTGTTGGGA GAGGCTTGTT AGTTGGTGAT GTTTTTAGTG GTGAACTTGG TCGTGAAGCG GGTGAAGATG TTGGCTCTTA CGCCATTACA CTCAATACGC TCCATAACGA TAACTATGCG ATATCGTTTA TTCCGGGCAC CTTTACCATT ACGCAACGTC CACTGACACT CAGTGCAACA TCAACCCAAA AAGTGTATGG TGAAGCCGAT CCAACCTTAG CCGTAACCAT CACTTCAGGA TCGCTTGCAA GCACCTTGCG GCAGGATGCT TTAAGCGATG TTGTGGGCAC GCTAAACCGT GAAGTGGGTA ATAACGTTGG GAGTTACGAT GTGGTGCTTG GCAGTGGCAG TCGCTCCTCA AATTACAACA TCACCTTTGC TGCCGACAAC AACGCCTTTA CTATTGCGCA GCGTCCACTC ACCGTTACAG CTTCTCCGCT TACGAAAACC TATGGTGATG CCGACGCAGC ACTTGCATGG CAGGCAGAAG CTGCAAGTAG CGGACGAGGA TTGCTTGCGA ACGATACGCT GCATGGTGAG CTTGCTCGTG AAGCGGGTGA AGATGTGGGG AACTATGCAA TTCTGCAACA CACGCTTGGC AACAACAACT ACGCTATTAG CTATCAAGGC AGCAATCTCT CCATCACCCA ACGTTCGCTT ACTCTTAGCG CAACACCTAC GCAAAAAGTA TATGGTGAAG CCGATCCAAC CTTAGCCGTA ACTATCACCT CAGGATCGCT TGCAAGCAGC AGTGTGCAAG ATGCGTTGGG TGATGTAACG GGCTTACTAA GCCGTCAAGT TGGTAACAAT GTGGGAAGCT ACGATCTGCA ACTTGGCAGT GGTAGCCGTG CCTCCAATTA CAACATCACC TTTACAGCCA ACAACAACGC ATTTACCATA ATGCCACGCC CTGTGGTGGT TGCGGCAAAC AATTTCAGCA AAGTGTATGG CGATGCCGAC CCCGCTCTCA CATGGCAAGC GGAATCCTCT GACCCTGCAC TTGCGGCTGA AAATCTTGCG TTGTTGCGCA GCCTTGAACT ATTCAATAAC ACGAATAACC TCTCAGGTAT AACGGGCGCT CCCTCCTCAA ACGAGTCGCT TCTCTCCTCA ACAGAAAATA ATGCTCAGAG TTCCAGCGAT ACCGCTTCAA CTTCTTCGAC AAATAATGAG GATGAGGAAA TGGTGGGAAT AAGAAGCCCA ATGGGAAACA TCTATATAAG TTTCCCATTG GCTGAATACG ACTTCAAGGT GGAGTGGTGT CAAGGGTCAC ATATCTTACA TGGCACAAAA CCAGCTTTTA TTGCTTCTGA ACGACTATGA
|
Protein sequence | MKTHPLFFPL HGRDVFVVAL CVTQLLLVVP QAQALPTGGA VVAGSANVTL PSATTMQIEQ ASQKAIINWQ SFGAERGERV QIVQPESSSV LLNRVIGNNP TSFFGQLQAN GQVFLVNPNG IYFAPTSQLN TGGLVASTLS LNDRDFLAGN YAFVAQGAMG ALLNEGTLQG GFVALLGSNV ENRGAIVTTR GTAALAAGEA MTLNLDASGL VALTVDQAAY NAHIRNSGIL EAEGGTVVLN AGAAEDVLAG VVNNSGRVVA TSVSERNGAI VIEGGSLVQT GEVVAPTINV AVNRMVDAGS WRAEQGNITI HAATTIEQTA ASHISASGKQ GGSVRLEAGK QLYLSGAIES NGTDGQSGSG GTIAVTSPTT TIAGATLSAN GGTDGGMVLI GGGWQGSEPN LPNAATTTVT ASSSISANAS TVGNGGTVVV WSEQATTFAG TIAANGGSES GNGGAVEVSG HEQLAMSGTV STSAHHGEAG FLLLDPRNIT IEQPLLLSQF QFQLISLLDP NATAGNQHGS GAILELLNGN LLVTSPLDDV GGSDAGALRL YRPDGTLLST LTGSATGDLS GGTITPLQGN SNAVFLASNW SNGTAAKAGA VTWIDGTNGV SGTISEGNSF VGTHANDGMD AEVIALSNGN YVAHLPSWQH DEVLNAGAVA FGNGTSGSAG TISEANSLVG TKANDSDSAK VVALTNGNYV VASPLWDNGS TTNVGAVTWG NGQTGKVGAI SGSNSLIGTK SGDNVGLQVT SLANGNYVIG SPNWDNGSTA NVGAVTWADG NLSIHGALSA TNSLVGAKSG DYVGSSVTAL TNSNYVVVSQ SWSSDTATDV GAVTLGHGDA GTTGVVTADN SLVGSSTGDG EKLSATALAN GNFVVVAPKW DGDATNMDVG AVVLGNGVTG SVGQISATNA LVGTTANDLE SATVTPLTNG NYVVAATKWD NGVVADAGAV IVGSGTTSIT GTISAANSLV GSVSNDLLSA TITPLTNGNY VVAASKWDNG AVLDAGAVAW GNGQAGTVGS ISESNSLVGN KKDDFSGLTI TALHNGNYVV SASLWDNGSI TNVGAVTWGN GQTGTVGTIN STNSLIGAKS GDKVGAVTVA LSDGNYATAS GECDNGSLAN AGAVTFGNGG GGTVGVVSSA NSVMGSEKDG KIGSGGLTPL RVGSVSGGVV VSSPLAQASN GNVTLFAPST ANEAGMLSAD YTYAADGSSN VTVTPTQLAT LLNNGTSVRL QASNTITLNT LLTANASSST TLELHAGKSI LLNNSIVTGN GNLTLIANDS AEHGVDNTLR ESGAAVISMA SGTAINAGTG QVVVELRDGG ERANNASGDI TLGSVTAGTI SVANNGSSNS SGVVLAGAAL TANESNGSTI VLSGQHFTNS ANATLNTEPE ARWLIYSSSP EATQKGGLTS SFRSYNVLPA TYAAAAVTEQ GHGFLYASAP SQLGVNITLN NGSASSVYGN EPNATLGYSL HGFADNEESA NTIGLEGSMQ VSGMPNTTSS VGTYNVAYAG GLTSSKGFTF TAGTPLALTV EPQPITVNPD DQEKTYDDTD PDLTWQVEAQ GVGRGLLVGD VFSGELGREA GEDVGSYAIT LNTLHNDNYA ISFIPGTFTI TQRPLTLSAT STQKVYGEAD PTLAVTITSG SLASTLRQDA LSDVVGTLNR EVGNNVGSYD VVLGSGSRSS NYNITFAADN NAFTIAQRPL TVTASPLTKT YGDADAALAW QAEAASSGRG LLANDTLHGE LAREAGEDVG NYAILQHTLG NNNYAISYQG SNLSITQRSL TLSATPTQKV YGEADPTLAV TITSGSLASS SVQDALGDVT GLLSRQVGNN VGSYDLQLGS GSRASNYNIT FTANNNAFTI MPRPVVVAAN NFSKVYGDAD PALTWQAESS DPALAAENLA LLRSLELFNN TNNLSGITGA PSSNESLLSS TENNAQSSSD TASTSSTNNE DEEMVGIRSP MGNIYISFPL AEYDFKVEWC QGSHILHGTK PAFIASERL
|
| |