Gene PCC7424_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_1759 
Symbol 
ID7110417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp1963244 
End bp1965520 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content42% 
IMG OID643480021 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_002377062 
Protein GI218438733 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.451237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGTA ATCTTTTTTT TTGGCAAATT GCTGTACAAT TAGGGACTTT ACCCATTTTG 
ACTTTTTCAG CTATGGCTCA AGTCATTCCT GACGGAACGT TACCGACAGA AGTTGAACAA
GTCAACAATG AGTATCGAAT TTTAGGGGGA GCAAAAGCAG GAAATAATTT ATTCCATAGT
TTTAGGGAAT TTTCAGTTCC TCCAACATTT CAAGCTATTT TTAATAATTC CAATGATCTT
GCCAATATTT TTGGGCGAGT TACGGGTAAT TCTATCTCCA TCATAGAGGG AATCATAAAA
GCCAATGGGA CGGCTAACTT ATTCCTGATT AATCCTAACG GGATTATTTT TGGTCCCGAT
GCTCAACTGA ATATAGGCGG GTCATTTTTT GCCAGTACCG CCAGTAGTAT CAAGTTTGCT
GATGGGACTC AGTTTAGTAG TAATAATCTT TCAACTCCCC CGGTCTTAAC CGTCAGTGTG
CCCATTGGCT TACAATTTGG CAATAATCCT GGTTCTATTG TCAATCGTTC TGTGAGTAAA
AATGCAAACA ATGTTCCCGT TGGGTTAAAT GTTAATCCTG GGAATACTTT AGGATTAGTG
GGAGGTAATA TACAGCTAGA AGGAGGACGC TTAACGGCGA TTGGGGGACG TATTGAACTA
GGAAGTGTTT CTAGCCAGAG TTTTGTCAAT TTAATCCCTA TTAATCAAGG CTGGCAATTG
GGATATGAAG GAGTAAGAGA TTTCTCTGAT ATCCAACTAT CTAATCAAAG TTTAGTCAAC
GCTAGTGGCG CAGGGGGAAC TGCTCAACTT CAAGGGCGTA ATATTGAATT AACTACAGGA
GCGCTCATTC AATTAAATGT AATAGGAAAT GAATCTTCTG GAAATTTAAT CATTAACGCT
TCTGAATCTC TAGTGCTTCA ATCAACACCC TCACAAATTA CTGGTTTATC GGCATTTATG
GCTCAGGATC TTGAGGGGAA TGGTGCTAAT ATTACTCTGA ACGCAAAGAA TTTAACAATT
CGAGATGGAG CAGTAATTAG TTTAGTAACT GATGGACAAG GGACAGGAGG TCAGTTAACA
ATTAATGCAG CTACTGTAGA AATTCTTGGA GTAGGTGATG CGATTCCTAG CTTAATAACT
ACTACTACTG GTGGAAAAGC GGATGCGGGT GCAATAACTA TAAACACTGA GCGATTGATT
GTCGCTAACG GAGGACAGAT TCTTGCCGTT ACAACCAATG TGGGTAATGG AGGTATTATT
ACTATTAATG CCTCTGAATC TGTAGAACTC ACTGGCTCAA TACCACTTCC TTCAAGAAAC
ATTATAACTC CTAGTCTTAT ATCTACTGCC TCTGGATTTG AAGATTTAAA TCAGCCAGGA
ATTGGCACAG CAGGCAACAT AAACCTTAAC ACAGGACAGT TAATCATATC TGACGGGGGT
CGCATTTCTA CCGGCAGTTT CCAAGGTGGA GAAGCCGGCA ACTTAACCAT CAACGCTAAT
TCTATTCTCC TCAATGGAGG TCGAATCTTA GCAGAAAGTA CCTCCTCTTC CGGAGGCAAC
ATATTTATTA ACGCTTCCGA CGTTTTAACC CTGCGAAATA ATAGCCTCAT CTCCGCCACC
GCCGGCACCG CACAAGCAGG AGGCAACGGA GGCAACATTT TCATTAATGC TCCCTTTATT
TTTGCTGTGC CCATTGAAAA TAGTGATATT CGGGCCGATG CTTTTACAGG GACAGGAGGC
AACGTTACCA TTAATGAAAA TTCTATTTTT GGGATTGCCT TGAGAGACGC TCTCACCCCA
TTAAGCGATA TTACCGTGAG TTCTCAATTT GGGCAAGCCG GAAACGTTCT GTTTAACCGT
CCCGAAGTCG ACCCTCAAAC AGGTCTTGTC ACTTTCCCCT CACAAATAGG CAATGATGAA
ACTGTGATTG TCCAGACTTG CGGTGTGGGG GGAGCATTTG CCAGGGGAGA ATTTATTATT
ACAGGACGAG GTGGTATGGT AGAAAATCCC TATGATGCCC CTGAAATAAG TACCACTTTG
GCTGATTTAG GAGAAAATGT GGCTTCTGGG TCTTCTGCCA CCTCAAAGCC TCATTATTCT
GAATCTGTCC CTGTCTCTTC ACCTCCCCAG AGAATTATAG AAGCTCAAGG CTGGATTAGG
GATTCCTCCG GAAATCTTAT TTTAACGGCT CAAGTCGAAA ATGTGACCCC AAGCACTTCC
GGACTCAATT CAACCCATTG TGACCTTTCA GGGGGGAACT TAACCCCATC AAGGTAA
 
Protein sequence
MKCNLFFWQI AVQLGTLPIL TFSAMAQVIP DGTLPTEVEQ VNNEYRILGG AKAGNNLFHS 
FREFSVPPTF QAIFNNSNDL ANIFGRVTGN SISIIEGIIK ANGTANLFLI NPNGIIFGPD
AQLNIGGSFF ASTASSIKFA DGTQFSSNNL STPPVLTVSV PIGLQFGNNP GSIVNRSVSK
NANNVPVGLN VNPGNTLGLV GGNIQLEGGR LTAIGGRIEL GSVSSQSFVN LIPINQGWQL
GYEGVRDFSD IQLSNQSLVN ASGAGGTAQL QGRNIELTTG ALIQLNVIGN ESSGNLIINA
SESLVLQSTP SQITGLSAFM AQDLEGNGAN ITLNAKNLTI RDGAVISLVT DGQGTGGQLT
INAATVEILG VGDAIPSLIT TTTGGKADAG AITINTERLI VANGGQILAV TTNVGNGGII
TINASESVEL TGSIPLPSRN IITPSLISTA SGFEDLNQPG IGTAGNINLN TGQLIISDGG
RISTGSFQGG EAGNLTINAN SILLNGGRIL AESTSSSGGN IFINASDVLT LRNNSLISAT
AGTAQAGGNG GNIFINAPFI FAVPIENSDI RADAFTGTGG NVTINENSIF GIALRDALTP
LSDITVSSQF GQAGNVLFNR PEVDPQTGLV TFPSQIGNDE TVIVQTCGVG GAFARGEFII
TGRGGMVENP YDAPEISTTL ADLGENVASG SSATSKPHYS ESVPVSSPPQ RIIEAQGWIR
DSSGNLILTA QVENVTPSTS GLNSTHCDLS GGNLTPSR