Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_5640 |
Symbol | |
ID | 7112967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011738 |
Strand | - |
Start bp | 18722 |
End bp | 21190 |
Gene Length | 2469 bp |
Protein Length | 822 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643483937 |
Product | filamentous hemagglutinin family outer membrane protein |
Protein accession | YP_002380946 |
Protein GI | 218442626 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00258784 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAA TAACTAGCTT TTTAAGTGGC TCTCTTTTCA ATCTACTCTT TTTGAGTACA GTAGCTCAAG GTCAAATCAC TCCCGATCAA ACCTTGCCGA ATTCTTCCTT CGTTAACGAG AACGGGAATT TACTTTTAAT TGAAGGAGGA ACCGAAAGGG GAAATAATTT ATTTCACAGT TTTACTGAAT TTTCTTTACC TACGGGACAA ATCGCTTTTT TCAATAATGA TATTTTAATC CAAAATATTT TTTCTCGTGT AACAGGCAAC TCCATTTCTC ATTTAGATGG AATCATCAAA GCTAATGGTT TGGCAAATTT ATTTTTTTTA AATCCGAACG GGATTATTTT TGGGCCTAAT GCTCGTCTAG CAATTGGTGG CTCATTTATT GCCACGACCG CGAATAGTAT TATATTTGAT AATGGGTATC GATTCAGCGC AACCGATCCT AATTCTTCTT TTCCTGTTAC GATTAATGTC CCTATCGGAT TAGATTTTAG TCCGGAAACT AATTTAGCGG GAATCATTGA AGGCTTTGGA ACAGGACATT CTTTTGCCGG TCTTGAGCAG CCACAATTCG GAGGAACTCA ATTTTTAAAA GGCTTAAATG TTGCTTCTGG AAAAACCCTT GCTTTTATTG GTAGAGAAAT TAATTTTAAT GGATTTGTGG CAACGGCTGG ACAAGTAGTT TTTGATGTTT CTTCACAACA ATTAAATCCT TCAATTTCAG GAAATCTTGA GATAGCCAGT ATTAAAGAAG GATTAATCGG CTTAGATTTT AATAATTTAT CGGAGATCAA TTTTAATTAT AATCAAGTTC TCTCTTTTGG TAATGTTACT TTAGCTCAAA AATCTTTATT AGATGTCAGT GGGTTAAATG GAGGAAATTT ACAGATTAGA GCTAAAGATT TGACTGTTAC TGATGCTAGT CTTTTAATGA ATTCTAACTA TGGCGATCAA ATGTCTTCTG TCGGGTCTAT TAATATTAAT TTAACTGGAC ACTTAAATTT AGTCGGAGTA ACTGGTTTTA ATACCGATAA TACGGCTGAA AATACCACGA TAAGAGGAAT AATTTCTCAA AATTTTGCCG AAGCCAACTC TCCTAATATT TTTTTAAATG CTCAAAACAT TATTTTATCA GATACGGCTG GGATTCTTAC TTTAGCAATT GGGGCTGGAA AAAGCGGAGA CATATTTATT ACAGCTAAAG AGTCTTTAAT TGTCGGAAAA AGATCGCCGT TTGAGCCTTT TGCTGTTGGC AGTCAAATTA TTACACAAAG TTCATCAACT ATGTTTACCG GTGGCGGACA AGGTGGTAAT ATTTATATTC AATCTCCCCA AGTTATTTTA AAGGATGGCG GTTCTATTCA AACCGTTACT TATGGGTTAA ATCCAAGCGG AAATATTGAG ATTAAAACTA ACGATATTAG TATTAGTGGC TTTGTTCCGG CGGATATGGG ATTTTATCCG ACCAGCTTAG GAACAGTAAC ACGGGGACTG GGAAAATCAG GAAATACAGT GATATCTACT GATAGGTTAA CTGTAACTAA TGGAGCTAGA ATTAATTCTA CTTCTTTAGG AGCCGGAAAC GGAGGGGATA TTATCATTAA TGCAAACCAG GGCGTTTTGA TTGAAGATAC TATTTATAGT GGACAGGATT CCTCTAAAAT AATTGCTTCC GCTAATCAAC TTAACGAATT ATTTTACGAA GTTTTTAAAC TGCCTCGCAA TTTAACGGGA AATTCTGGAC GAATTTTTTT AAATACGCCC AATTTACAAC TAGCTGATGG AGGTAAAATT ACTGTTCAAA ATGATGGAAC GGGAGCAGCA GGAATTATCG ATATTACAAG CGAGAATCTA ACTCTGATTA ATTCTGCTTC TATCGATGCC AGTACGGTTT CAGGAGAAGG AGGAACTATT TTAATTGATT CAAAATTTAT TCACCTTAAT TCTTCTGCTA TTACGACAAC AGCCGGAGGA TTAGGAAACG GCGGGGATAT TTCTCTGACA ACCGACTCTC TCATTTTGTC AAATAATAGT GGAATACAAG CTAATGCTTT TGCCGGCAGA GGAGGAAATA TTGACATTGA AACAAAAGGG TTTTTTCTAT CTTCCAATAG CCAAATTACA GCTAGCTCTG AATTAGGAAT TGAGGGGGCA ATTACCATTA ACAATTTTCC TTTTCAGATA CAAGGGGAGC AAGCACAACT TCCTACTCCA CTCTCTTTAT CAGAAATAGC CGCACAAAGC TGTATTGCTT ATAAATCCCA AACCTATAAA GTGACCATTC GGGGTCAAGG GACTAATCAA GGCGATCTCA GTTCTCCTAG AGGCTATAAT TTTTTTGATT TAATCCCTCA AGGACAGGTT GTGGCGGCTG AAAATACATC CTCTGGTGTC AGACTCCTCA ATTGCGATCA ATATTGGGAG AAGTTACAAC AACAAGATCA AACTCCTGGC GTTCCTTAA
|
Protein sequence | MKKITSFLSG SLFNLLFLST VAQGQITPDQ TLPNSSFVNE NGNLLLIEGG TERGNNLFHS FTEFSLPTGQ IAFFNNDILI QNIFSRVTGN SISHLDGIIK ANGLANLFFL NPNGIIFGPN ARLAIGGSFI ATTANSIIFD NGYRFSATDP NSSFPVTINV PIGLDFSPET NLAGIIEGFG TGHSFAGLEQ PQFGGTQFLK GLNVASGKTL AFIGREINFN GFVATAGQVV FDVSSQQLNP SISGNLEIAS IKEGLIGLDF NNLSEINFNY NQVLSFGNVT LAQKSLLDVS GLNGGNLQIR AKDLTVTDAS LLMNSNYGDQ MSSVGSININ LTGHLNLVGV TGFNTDNTAE NTTIRGIISQ NFAEANSPNI FLNAQNIILS DTAGILTLAI GAGKSGDIFI TAKESLIVGK RSPFEPFAVG SQIITQSSST MFTGGGQGGN IYIQSPQVIL KDGGSIQTVT YGLNPSGNIE IKTNDISISG FVPADMGFYP TSLGTVTRGL GKSGNTVIST DRLTVTNGAR INSTSLGAGN GGDIIINANQ GVLIEDTIYS GQDSSKIIAS ANQLNELFYE VFKLPRNLTG NSGRIFLNTP NLQLADGGKI TVQNDGTGAA GIIDITSENL TLINSASIDA STVSGEGGTI LIDSKFIHLN SSAITTTAGG LGNGGDISLT TDSLILSNNS GIQANAFAGR GGNIDIETKG FFLSSNSQIT ASSELGIEGA ITINNFPFQI QGEQAQLPTP LSLSEIAAQS CIAYKSQTYK VTIRGQGTNQ GDLSSPRGYN FFDLIPQGQV VAAENTSSGV RLLNCDQYWE KLQQQDQTPG VP
|
| |