Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_1755 |
Symbol | |
ID | 7110413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | + |
Start bp | 1954577 |
End bp | 1957474 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643480017 |
Product | filamentous hemagglutinin family outer membrane protein |
Protein accession | YP_002377058 |
Protein GI | 218438729 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.655087 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACTT TATCCCGATT GGCTATCGCT ACAAGCTGCT GTCTTGTGAT AGGAGTATGT TCTCCAGACA CAGCCCAGAG CCAAATCGTA CCAGACACGA CCTTATCTAA TCCCTCCCTG GTCACCAATA CGAGTAATAC TCAAGAAATT ACAGGAGGAA CAATAGTCGA TAATCATCTT TTTCATAGCT TTGAACAATT TTCTGTTCCT ACAAACACTG AAGCTTATTT TAATAATAGT TTAAATATTA TTAATATTTT TAGTCGCATT ACGGGCAATT CTATTTCTAT AATTGATGGA TTAATTAAAG CCAATGGGAC AGCGAATTTG TTTTTGCTCA ATCCCAATGG AATTCTTTTC GGCCCTAATG CTTCCTTAAA TATTGGAGGT GCATTTGTGG CTAGCACAGC TAATAGTATT CAATGGGCTA ATAATCAAGA ATTTAGTGCA GTTAATCCCC AAGATGTCTC CCCTTTATTA ACGATGAACG TACCAATGGG GCTACAATAC GGAGCTAATC CAGGCAAAAT AGTCGTTCAA GGAACGGGAA ATAATTTAGG ACTCAGTGAG AATTTTGAAA TTATTCGGGA TTTTCGCCCC TCTGGCCTAG AAGTTCAGCC GGGTCAAAGC TTAATTTTAG CGGGAGGAGA CATAGAATTA GAAGGAGGAA ATATCACGGC TTCCGGAGGA CGAATAGAAC TTTGGTCAAT CAACAATGGC TCTGTTGAAA TCTCCAATAT CAACGGACAG ATACAACTGA CTTCTAGTGC CCAAACCGTT GAATCTGGCA ATATTCAACT TTTAGGAGCG GCTTCTGTAG ATGTAAGCGG GAGTCGAGGC GGAGATATTC AAATTCAAGC AGAGAATTTA AGTTTAAGGG AAGGCTCAGT TATTTTAGCC GATACTCTAG GAAATCAGAC CGGGGGAAAC GTGATCATTA AAGCCAATAA ATCTCTACAA CTTGCAGGAA CTTCTCCCTT TAGCCCAATT TTTAGCGGAA TTTTTACCGA TGCGGCCTCA ACCGCTACAG GAAAAGGAGG AAATTTAACC CTTGAAACCG GCTATTTACA ACTGGCCAAT GGGGCGCAAA TTAGCACGAA TACCTTTGGA GTCAGTGAGG GAGGAACATT AACCCTCAAA GCCAATCAGA TTGATTTGAT TGGGGGTTCA AGCTTTGGCC CTAGTGGGTT ATTTGCTGTA GTAGCTCCCG AAGCTACTGG CCAGGGAGGC AATTTATCAC TCATAACCGA TGGACTCTCT CTAGTGGGTG GGGCACAAGT TTCCGTTGGG ACGTTTGGCA GTGGTAACGC CGGGACTTTG ACCCTTAATG CCTCTAGGAT AAGTATCACA GGAACTTCTC CAGGAGGCAA TGCTAGCAGT ATAGTCGGAA ATGTAATCCC AACAGCAACC GGCAACGGTG GCAATATCTT TATTCAAACC AATTCTCTTA ATTTAATCGA TGGAGGACAA ATCGGCAGTG CGACTTTTGG GGTAGGAAGC GCCGGCAGAT TAACGATTAA CGCACAAGAA ATACAGATAA TCGGGGGAAC GCCCTCAGCC CCAAGCGGAG TCTACACCAC AGTTGAACCA AACGCAACCG GCAACGGAGG ATCACTGACT CTAGAAACGG AATCTTTACG CATAGTCGAT GGGGGACAAA TTGCCGTTAG TACCGCAGGA ACGGGAACAG CCGGAAATTT AGTAATACAA GCAGATACTA TACAACTGAT CGGCTCGTCA GAGTTTGGAG CTAGTGGTTT ATTTGCTAAC GCCATCGCCG ACACAGGACA AGGGGGCAAT ATTTCCTTAA CCACAGATCA ATTAATCCTC CAAGACGGAG CAACCATTAA TTTAAGTAAC TTTTCCAGTC GTAATCCCAC TATTCCTGCC GGCCAGGGTT CACCCGGTAA TCTCAACATT CGCGCCGGTT CAATTTTACT GGACAATCAA AGCACCCTTA CCGCCGCTAC CCTTGTGGGC AGTCAAGGCA ATATTACCCT TTTATCTGGG GATCTTCAAC TGCGTCAAGG CAGTCTCATT ACTACGAATG CCTCTGGCCA GGGAAGTGGC GGAAACATTA TCATTAATAG CGATCGTCTA ACGATGACCG GAAACAGCGA AATTTCTGCG AGTGCTACCG GCATAGGTGC AGGAGGAAAT GTAGAAATTA TCACCCTTTC CCCTCTATTG CTCAATCAAA GTACCATTAG TGCCACTGGA GGACAAGGAA ATATTTTCCT ACAATCTCCT GGGGTAGAAC TTCGCAACGG GAGTAATTTA AGTACCAACG GCATCGGCGA TGCTCCTGGA GGTAACATTA TCATCAATAC CGCCTATCTT TTTGCTGTTC CTCAAGAAAA TAGCGATATT ACCGCTAATG CTCAAAATAG TTTTGGCGGT CGAGTGATTA TCTTTACCGA TGGGATTTTT GGCCTCGAAG TCCGAGAAAA CCTTACCCCT CTGAGTGATA TTACGGTGTC TTCTCAATTA GGGCTAGAGT TTGGGGGACT TGTTGAGATC AACAATCAAG GGTTTGATCC CCGTTCAGCC TTAGTTCAAC TGTCCACAGA AGTGGTTGAT GCTAGCGATC AAATCGCTCA AGGTTGTGAT GCTCAACAAG GGAATGTTTT TGTAGTCACC GGACGAGGAG GACTACCCGA AAATCCCAAT CAAACCTTAC AAAGTCAGTC CCTCTGGCAA GATTTAAGAA CTGTCTCAGA AAATCCCCCT CAATCTTCCT CCTCCTCATC CGCTCAAACT CCTGACCATA GGATGATTAC AGAAGCTCAA GGATGGATTA TTAATCCTGA TGGTCAAGTC GATCTCATTG CCCAAGCCCC TACTCAAACA TCATGGCATC AACCGAGCCA ATGTCACTTA AAAGGAGGAA AAAAATAG
|
Protein sequence | MKTLSRLAIA TSCCLVIGVC SPDTAQSQIV PDTTLSNPSL VTNTSNTQEI TGGTIVDNHL FHSFEQFSVP TNTEAYFNNS LNIINIFSRI TGNSISIIDG LIKANGTANL FLLNPNGILF GPNASLNIGG AFVASTANSI QWANNQEFSA VNPQDVSPLL TMNVPMGLQY GANPGKIVVQ GTGNNLGLSE NFEIIRDFRP SGLEVQPGQS LILAGGDIEL EGGNITASGG RIELWSINNG SVEISNINGQ IQLTSSAQTV ESGNIQLLGA ASVDVSGSRG GDIQIQAENL SLREGSVILA DTLGNQTGGN VIIKANKSLQ LAGTSPFSPI FSGIFTDAAS TATGKGGNLT LETGYLQLAN GAQISTNTFG VSEGGTLTLK ANQIDLIGGS SFGPSGLFAV VAPEATGQGG NLSLITDGLS LVGGAQVSVG TFGSGNAGTL TLNASRISIT GTSPGGNASS IVGNVIPTAT GNGGNIFIQT NSLNLIDGGQ IGSATFGVGS AGRLTINAQE IQIIGGTPSA PSGVYTTVEP NATGNGGSLT LETESLRIVD GGQIAVSTAG TGTAGNLVIQ ADTIQLIGSS EFGASGLFAN AIADTGQGGN ISLTTDQLIL QDGATINLSN FSSRNPTIPA GQGSPGNLNI RAGSILLDNQ STLTAATLVG SQGNITLLSG DLQLRQGSLI TTNASGQGSG GNIIINSDRL TMTGNSEISA SATGIGAGGN VEIITLSPLL LNQSTISATG GQGNIFLQSP GVELRNGSNL STNGIGDAPG GNIIINTAYL FAVPQENSDI TANAQNSFGG RVIIFTDGIF GLEVRENLTP LSDITVSSQL GLEFGGLVEI NNQGFDPRSA LVQLSTEVVD ASDQIAQGCD AQQGNVFVVT GRGGLPENPN QTLQSQSLWQ DLRTVSENPP QSSSSSSAQT PDHRMITEAQ GWIINPDGQV DLIAQAPTQT SWHQPSQCHL KGGKK
|
| |