Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_1759 |
Symbol | |
ID | 7110417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | - |
Start bp | 1963244 |
End bp | 1965520 |
Gene Length | 2277 bp |
Protein Length | 758 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643480021 |
Product | filamentous hemagglutinin family outer membrane protein |
Protein accession | YP_002377062 |
Protein GI | 218438733 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.451237 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATGTA ATCTTTTTTT TTGGCAAATT GCTGTACAAT TAGGGACTTT ACCCATTTTG ACTTTTTCAG CTATGGCTCA AGTCATTCCT GACGGAACGT TACCGACAGA AGTTGAACAA GTCAACAATG AGTATCGAAT TTTAGGGGGA GCAAAAGCAG GAAATAATTT ATTCCATAGT TTTAGGGAAT TTTCAGTTCC TCCAACATTT CAAGCTATTT TTAATAATTC CAATGATCTT GCCAATATTT TTGGGCGAGT TACGGGTAAT TCTATCTCCA TCATAGAGGG AATCATAAAA GCCAATGGGA CGGCTAACTT ATTCCTGATT AATCCTAACG GGATTATTTT TGGTCCCGAT GCTCAACTGA ATATAGGCGG GTCATTTTTT GCCAGTACCG CCAGTAGTAT CAAGTTTGCT GATGGGACTC AGTTTAGTAG TAATAATCTT TCAACTCCCC CGGTCTTAAC CGTCAGTGTG CCCATTGGCT TACAATTTGG CAATAATCCT GGTTCTATTG TCAATCGTTC TGTGAGTAAA AATGCAAACA ATGTTCCCGT TGGGTTAAAT GTTAATCCTG GGAATACTTT AGGATTAGTG GGAGGTAATA TACAGCTAGA AGGAGGACGC TTAACGGCGA TTGGGGGACG TATTGAACTA GGAAGTGTTT CTAGCCAGAG TTTTGTCAAT TTAATCCCTA TTAATCAAGG CTGGCAATTG GGATATGAAG GAGTAAGAGA TTTCTCTGAT ATCCAACTAT CTAATCAAAG TTTAGTCAAC GCTAGTGGCG CAGGGGGAAC TGCTCAACTT CAAGGGCGTA ATATTGAATT AACTACAGGA GCGCTCATTC AATTAAATGT AATAGGAAAT GAATCTTCTG GAAATTTAAT CATTAACGCT TCTGAATCTC TAGTGCTTCA ATCAACACCC TCACAAATTA CTGGTTTATC GGCATTTATG GCTCAGGATC TTGAGGGGAA TGGTGCTAAT ATTACTCTGA ACGCAAAGAA TTTAACAATT CGAGATGGAG CAGTAATTAG TTTAGTAACT GATGGACAAG GGACAGGAGG TCAGTTAACA ATTAATGCAG CTACTGTAGA AATTCTTGGA GTAGGTGATG CGATTCCTAG CTTAATAACT ACTACTACTG GTGGAAAAGC GGATGCGGGT GCAATAACTA TAAACACTGA GCGATTGATT GTCGCTAACG GAGGACAGAT TCTTGCCGTT ACAACCAATG TGGGTAATGG AGGTATTATT ACTATTAATG CCTCTGAATC TGTAGAACTC ACTGGCTCAA TACCACTTCC TTCAAGAAAC ATTATAACTC CTAGTCTTAT ATCTACTGCC TCTGGATTTG AAGATTTAAA TCAGCCAGGA ATTGGCACAG CAGGCAACAT AAACCTTAAC ACAGGACAGT TAATCATATC TGACGGGGGT CGCATTTCTA CCGGCAGTTT CCAAGGTGGA GAAGCCGGCA ACTTAACCAT CAACGCTAAT TCTATTCTCC TCAATGGAGG TCGAATCTTA GCAGAAAGTA CCTCCTCTTC CGGAGGCAAC ATATTTATTA ACGCTTCCGA CGTTTTAACC CTGCGAAATA ATAGCCTCAT CTCCGCCACC GCCGGCACCG CACAAGCAGG AGGCAACGGA GGCAACATTT TCATTAATGC TCCCTTTATT TTTGCTGTGC CCATTGAAAA TAGTGATATT CGGGCCGATG CTTTTACAGG GACAGGAGGC AACGTTACCA TTAATGAAAA TTCTATTTTT GGGATTGCCT TGAGAGACGC TCTCACCCCA TTAAGCGATA TTACCGTGAG TTCTCAATTT GGGCAAGCCG GAAACGTTCT GTTTAACCGT CCCGAAGTCG ACCCTCAAAC AGGTCTTGTC ACTTTCCCCT CACAAATAGG CAATGATGAA ACTGTGATTG TCCAGACTTG CGGTGTGGGG GGAGCATTTG CCAGGGGAGA ATTTATTATT ACAGGACGAG GTGGTATGGT AGAAAATCCC TATGATGCCC CTGAAATAAG TACCACTTTG GCTGATTTAG GAGAAAATGT GGCTTCTGGG TCTTCTGCCA CCTCAAAGCC TCATTATTCT GAATCTGTCC CTGTCTCTTC ACCTCCCCAG AGAATTATAG AAGCTCAAGG CTGGATTAGG GATTCCTCCG GAAATCTTAT TTTAACGGCT CAAGTCGAAA ATGTGACCCC AAGCACTTCC GGACTCAATT CAACCCATTG TGACCTTTCA GGGGGGAACT TAACCCCATC AAGGTAA
|
Protein sequence | MKCNLFFWQI AVQLGTLPIL TFSAMAQVIP DGTLPTEVEQ VNNEYRILGG AKAGNNLFHS FREFSVPPTF QAIFNNSNDL ANIFGRVTGN SISIIEGIIK ANGTANLFLI NPNGIIFGPD AQLNIGGSFF ASTASSIKFA DGTQFSSNNL STPPVLTVSV PIGLQFGNNP GSIVNRSVSK NANNVPVGLN VNPGNTLGLV GGNIQLEGGR LTAIGGRIEL GSVSSQSFVN LIPINQGWQL GYEGVRDFSD IQLSNQSLVN ASGAGGTAQL QGRNIELTTG ALIQLNVIGN ESSGNLIINA SESLVLQSTP SQITGLSAFM AQDLEGNGAN ITLNAKNLTI RDGAVISLVT DGQGTGGQLT INAATVEILG VGDAIPSLIT TTTGGKADAG AITINTERLI VANGGQILAV TTNVGNGGII TINASESVEL TGSIPLPSRN IITPSLISTA SGFEDLNQPG IGTAGNINLN TGQLIISDGG RISTGSFQGG EAGNLTINAN SILLNGGRIL AESTSSSGGN IFINASDVLT LRNNSLISAT AGTAQAGGNG GNIFINAPFI FAVPIENSDI RADAFTGTGG NVTINENSIF GIALRDALTP LSDITVSSQF GQAGNVLFNR PEVDPQTGLV TFPSQIGNDE TVIVQTCGVG GAFARGEFII TGRGGMVENP YDAPEISTTL ADLGENVASG SSATSKPHYS ESVPVSSPPQ RIIEAQGWIR DSSGNLILTA QVENVTPSTS GLNSTHCDLS GGNLTPSR
|
| |