Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1053 |
Symbol | |
ID | 3747034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1425092 |
End bp | 1429435 |
Gene Length | 4344 bp |
Protein Length | 1447 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637773582 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_379358 |
Protein GI | 78189020 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0559145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGGG TATTTAATGT GATCTGGTCA ATTACCAGAG AAAAATGGGT TGTGGTTTCA GAAAGAGTTA AATCAAATGG CTCGGTGCCA AAATCATCAT TAGTGAGCAT TGCTTTTCTT TCGGCATTGC TTGGTGGAGG CAGTGTTGCG CAAGCGGTAG ATGCCAACCA GTTACCAACG GGCGGTGTTA TTGCCGCTGG TAGTGGCTCT ATTGCGGCAA GCGGCAACAG CATGACGATT CAGCAGTCAA GCCAAAAGAT GGTTGCTAAT TGGAGCAGCT TTAACGTTGG TAGCGATGCA AGTGTGCGTT TTCAGCAACC AAATGCTTCA GCCGCAGCAC TGAACCGTAT TGCAGGACAA AGTCCTTCAC AAATTCTTGG TTCACTTTCG GCTAACGGGC GTGTTTTTCT TGTCAACCCA TCAGGTATTG TGTTTGGCAA AAATGCTCGT GTTGATGTTG GTGGATTGGT GGCTTCAACC CTCAATATTT CCGATAACGA TTTTCTTGCA GGGAACTACG CTTTCCGCTC AACGGGATCG GCAGGAACGT TACGCAATGA AGGTGTGATT AACGCAATGC CGAATGGCGT GGTGGCACTT TTAAGTCCAT CAGTTGTTAA TAATGGCACT ATTAATGCAG CAGGTGGCAC GGTAGCACTT GCCGCAGGTA ACGCTATGAC GCTTGATTTT GGCGGTGATG GCTTAATGAC GGTTCGCGTG GACGAAGGTG CCGTGAATGC GTTGGTTGAA AATAATGCGC TCATTAAAGC AGATGGTGGG CTTGTTGTAA TGAGTGCAAA AGCGGCGGAT GAGTTAGCAC TTTCAGCAGT AAACAGCAGC GGTGTGGTGC AAGCAATGAG CGTTGTTGAA AAAAATGGAC GCATTTTGCT TGATGCCGAA GGTGGGCAAA GCACCATTTC GGGCACACTT GATGCCTCGT CAGTTGATGG CAAGGGTGGT CAGGTTGTGG TTACAGGCAA GCAAGTAATG GTTGCCGATG GCGCTCATTT AAACGCCTCA GGTCTCACTG GTGGTGGCGA AGTGCTTGTT GGTGGTAGTT GGCAAGGTAG CGATGCCTCC GTTCGTCAAG CTGTTGGCAC GGTGGTTATG CCCGGAGCTT TGTTACAAGC AAATGCAACA GGCAACGGCA ACGGTGGCAC GGTGGTTGTT TGGTCAGATG TAAATAATCC ACTCTCTGTT ACTCGTGCTT ACGGTACCTT TGAAGCGTAT GGTGGATTGT TAGGCGGTAA TGGTGGACGT ATTGAAACTT CGGGACATTG GCTTGATGTG GCAGGCTCAC GCGGAGGAGC ATCGGCAGTA AATGGCAATG CGGGTGTGTG GCTGTTTGAT CCTTGGAATG TGATTATTGG TCCAGATCCA ACAACGAGTG GAACATCGTT TACTAATCCA TTTAATCCCA CTGGAGATTC AACGATTCTT GCATCGAACA TTAATACTTT GCTCAATGCA GGAACAAGTG TGTCCATTAC TACGGGTACG GGAGGTACAG TTGGGGTAGG AGATATTTCG GTTAATGCTC CTATTTTAAA AACTACAGTA ACAGGTCTTA ATACTTTGAC GTTAAGTTTA ATTGCTGAAG GAAATATTTT TATCAATAAT TCCATTGGTA ATTCTTCGGG TACTCTCAAT CTCAATTTAA CAACGGTAAA TGGTGCAATT AGTGGCACAG GAAATATTAC CGGTAATGGT AATGGAGATA CAATTTTTAC TGTTGGTGCT GGAAGTGGTA CCTATAGCGG AAACCTTGTT GATCGTCGTT TTGTTGAAAA GAAAGGAGTA GGAACCTTGA TTGTGTCTGG TGATAATAAT CATGATGGTG AAACAAGAAT TTCTGCAGGA ACATTGGTGG TTCAAAGCTC GACCGCTTTA GGTAAAACAA CAAATGGCAC TCAAGTGGTT GATGGAGCAA CTTTGCAATT AGAAGCCAAT ATTGCAGCAC AAGAATTACT TTATCTTGCA GGTGATGGGG TTAATTCAAA TGGTGCTTTA AAAAATATTG GGGGAAATCA TGTCTATGGT GGAGATATTA TTTTACTTAA CAATAGTAGG ATAATGTCTG ATGCTAATAC ATTGACCTTA AATGGTTCTG TTAATGGAGC ATATTCTCTT ACCGTAAATA GTGTCGGTAG TACAATCTTT AATGGATTAA TTGGTAATTC AGCTCCTCTT GGTGCATTTA TAGGTACTGC TGGTACGCCA ATTACTTTTA ATGGCAGTTC CATTACAACA GTAGGTGCAA TAAATGCTGC TGGAGTGGTT ACAGCTTCTA ACCCATTAAC TATATCGGCA GGTGCTGGTA ATATCTCATT ATCAAATACG GGCAATAATT TTAACTCGGT TAACATAACA AGCGCAGGCA CTGTCTCATT AGTTGATACT AATGCTTTGG CGCTTACAGG TGTAAATGCA ACCGGAGATG TTTCAATTGC CACAAGAAGT GGTGATTTAA CTATTGATGG TCATCTGTTA ACAACGAGTC CAACATCATC AGCAATGATC CTTAATGCTG AACAAGCACA AATTGCAGGT AATGGTAATG GAGGTAATCT CGTGTTTTCA AGCGGTACCC TTACTGTTGG TTCGGGTGGT ATAGCCACTC TTTATACTGG CAGTGTAGCT GGTAGCACAT CAATTGCTTC AGTTGTTAAT GCAGGTCATT TCCGCTATAA CAGTGATGAA GCAATAAATG GCACGCATTA CACTGATCCA TTAACTGCTG GTTTAAACCT CATTTATCGT GAGCAACCAA CGCTTCTTGT GGCTCCTGCT GCAACACCCA CACCCTATGG AACAGCTCCA TCTTACACAC CATCCTATTC GGGAGCTGTT AATAATGATC CTACTGTTGG CACGGTTGCA GGTACGCCAC AATGGGCATT TGATAATGCA ACAATACCAA CAAAATCCTT ATCTGGTCAA GATGAAGTTG GTACGTATAA CGTAAAATAC GTTGGAGGTT TAACGAGTAC GCTTGGTTAT GGATTTGCTG ACAATGGAGG AAATGGGGAA TTAACTATAG CTCCAAAAGA AATTGTTTTT GGTAATGGTT TAACGGGTGG TGTAAATAAT AAAGTATATG ATGGAACACT TACAGGTACT ATAACTCCAC TTGTGCTTTA TGTTGTTGCT GGTGATAATG TTAGCTTAAA TAGTACTGGT GCCACCGCTA CGTTTTCCAA TAAAAATGTT GGTGTAGGTA AAACTGTTAC CGTTGCCGGA TTAGCGCTTA CGGGGGATGA TGCAGGTAAC TACTCTATTG GTAACCAAAC AACAACAGCA AATATTATCC AAGCCTCATT AACCGTTACT GCTCCTGGTA ATCTTACCAA AGTATATGAC GGTACGGTTA CAGCTATAGG TGTTGCAACA GTAACAGGGC TTGTTTCTGG TGATACAGTT GCAGGAACGG TAGCTATTGC TTATGCCGAT AAAATGGCAG GCTCAAGCAA GGCTGTGAAT CCGTTGAGTG TAATGATTGT AGATGGTTCT GATATGAATA TGACCGGCAA CTATAACATT GCCTACGTTC CGACTGTCAA CAATACCATC ACTCAAGCGT CGTTAACGCT TACATCGCCT GATAATGTTT CAAAGTTTTA TGATGGTTTG ATGAGTGCTC CGGGTGCACC TATGGTTACT GGTTTAGTGC CGAATGATGT GGTAGTTACA CCGGCACCAC TCTCTTATAA TGATCCTGAA GTGGGAAACA ATAAAACCGT TTCACCAAAT CCTGCTGGAT TGGTTATACA CGATGCAAAT GGTGGCGATA TGACTCCAAA CTATGTTATT ACGACAATTC CGCGTAATGA TGGAGTTATT GTCGAAAAAA CCTTCACTCC ATATAAAGAA TGGAATGATA TTGATCCATC AACACCAGAA GTTCCAACAG CCGCACCTGA AGTGAGTGGC AACCGTGATT TGGGGGATGT TGAGCTTGCT GCGGATGATG GAGGCACAAC AGCTACTCGC TCGCTTGCCA TGGTAGCAAT GGATGAAACA GCTATTCAGT CAGATATTGT GGTTACGCTT TTGGAGCCTG CGGCTAAGAA TAAGCAAGGT GTGGTGAAGG TGTTTGTGCC AAAAGAGGTG CTTGCAAAGC CCGCTTTCTT GTTCCCACTG CCTGACGATG TGGCAACTGC AATTAATCAA ACTGCCGTAC AGGAAAGGGT TTTCTTGCAA AATGGTGATG CCTTACCTGG CTGGTTAAGC TATGACCGTG ATAAAAAAAT CTTTACCGCC AAAAGCGCTC CAGCAGGTTC GTTACCGCTG ACGGTGATGG TTCAAGCAGG CAGTATGGCT TGGCAGGTTA TTATTCAGCA GTAA
|
Protein sequence | MNRVFNVIWS ITREKWVVVS ERVKSNGSVP KSSLVSIAFL SALLGGGSVA QAVDANQLPT GGVIAAGSGS IAASGNSMTI QQSSQKMVAN WSSFNVGSDA SVRFQQPNAS AAALNRIAGQ SPSQILGSLS ANGRVFLVNP SGIVFGKNAR VDVGGLVAST LNISDNDFLA GNYAFRSTGS AGTLRNEGVI NAMPNGVVAL LSPSVVNNGT INAAGGTVAL AAGNAMTLDF GGDGLMTVRV DEGAVNALVE NNALIKADGG LVVMSAKAAD ELALSAVNSS GVVQAMSVVE KNGRILLDAE GGQSTISGTL DASSVDGKGG QVVVTGKQVM VADGAHLNAS GLTGGGEVLV GGSWQGSDAS VRQAVGTVVM PGALLQANAT GNGNGGTVVV WSDVNNPLSV TRAYGTFEAY GGLLGGNGGR IETSGHWLDV AGSRGGASAV NGNAGVWLFD PWNVIIGPDP TTSGTSFTNP FNPTGDSTIL ASNINTLLNA GTSVSITTGT GGTVGVGDIS VNAPILKTTV TGLNTLTLSL IAEGNIFINN SIGNSSGTLN LNLTTVNGAI SGTGNITGNG NGDTIFTVGA GSGTYSGNLV DRRFVEKKGV GTLIVSGDNN HDGETRISAG TLVVQSSTAL GKTTNGTQVV DGATLQLEAN IAAQELLYLA GDGVNSNGAL KNIGGNHVYG GDIILLNNSR IMSDANTLTL NGSVNGAYSL TVNSVGSTIF NGLIGNSAPL GAFIGTAGTP ITFNGSSITT VGAINAAGVV TASNPLTISA GAGNISLSNT GNNFNSVNIT SAGTVSLVDT NALALTGVNA TGDVSIATRS GDLTIDGHLL TTSPTSSAMI LNAEQAQIAG NGNGGNLVFS SGTLTVGSGG IATLYTGSVA GSTSIASVVN AGHFRYNSDE AINGTHYTDP LTAGLNLIYR EQPTLLVAPA ATPTPYGTAP SYTPSYSGAV NNDPTVGTVA GTPQWAFDNA TIPTKSLSGQ DEVGTYNVKY VGGLTSTLGY GFADNGGNGE LTIAPKEIVF GNGLTGGVNN KVYDGTLTGT ITPLVLYVVA GDNVSLNSTG ATATFSNKNV GVGKTVTVAG LALTGDDAGN YSIGNQTTTA NIIQASLTVT APGNLTKVYD GTVTAIGVAT VTGLVSGDTV AGTVAIAYAD KMAGSSKAVN PLSVMIVDGS DMNMTGNYNI AYVPTVNNTI TQASLTLTSP DNVSKFYDGL MSAPGAPMVT GLVPNDVVVT PAPLSYNDPE VGNNKTVSPN PAGLVIHDAN GGDMTPNYVI TTIPRNDGVI VEKTFTPYKE WNDIDPSTPE VPTAAPEVSG NRDLGDVELA ADDGGTTATR SLAMVAMDET AIQSDIVVTL LEPAAKNKQG VVKVFVPKEV LAKPAFLFPL PDDVATAINQ TAVQERVFLQ NGDALPGWLS YDRDKKIFTA KSAPAGSLPL TVMVQAGSMA WQVIIQQ
|
| |