Gene PCC7424_5640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_5640 
Symbol 
ID7112967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011738 
Strand
Start bp18722 
End bp21190 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content36% 
IMG OID643483937 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_002380946 
Protein GI218442626 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00258784 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA TAACTAGCTT TTTAAGTGGC TCTCTTTTCA ATCTACTCTT TTTGAGTACA 
GTAGCTCAAG GTCAAATCAC TCCCGATCAA ACCTTGCCGA ATTCTTCCTT CGTTAACGAG
AACGGGAATT TACTTTTAAT TGAAGGAGGA ACCGAAAGGG GAAATAATTT ATTTCACAGT
TTTACTGAAT TTTCTTTACC TACGGGACAA ATCGCTTTTT TCAATAATGA TATTTTAATC
CAAAATATTT TTTCTCGTGT AACAGGCAAC TCCATTTCTC ATTTAGATGG AATCATCAAA
GCTAATGGTT TGGCAAATTT ATTTTTTTTA AATCCGAACG GGATTATTTT TGGGCCTAAT
GCTCGTCTAG CAATTGGTGG CTCATTTATT GCCACGACCG CGAATAGTAT TATATTTGAT
AATGGGTATC GATTCAGCGC AACCGATCCT AATTCTTCTT TTCCTGTTAC GATTAATGTC
CCTATCGGAT TAGATTTTAG TCCGGAAACT AATTTAGCGG GAATCATTGA AGGCTTTGGA
ACAGGACATT CTTTTGCCGG TCTTGAGCAG CCACAATTCG GAGGAACTCA ATTTTTAAAA
GGCTTAAATG TTGCTTCTGG AAAAACCCTT GCTTTTATTG GTAGAGAAAT TAATTTTAAT
GGATTTGTGG CAACGGCTGG ACAAGTAGTT TTTGATGTTT CTTCACAACA ATTAAATCCT
TCAATTTCAG GAAATCTTGA GATAGCCAGT ATTAAAGAAG GATTAATCGG CTTAGATTTT
AATAATTTAT CGGAGATCAA TTTTAATTAT AATCAAGTTC TCTCTTTTGG TAATGTTACT
TTAGCTCAAA AATCTTTATT AGATGTCAGT GGGTTAAATG GAGGAAATTT ACAGATTAGA
GCTAAAGATT TGACTGTTAC TGATGCTAGT CTTTTAATGA ATTCTAACTA TGGCGATCAA
ATGTCTTCTG TCGGGTCTAT TAATATTAAT TTAACTGGAC ACTTAAATTT AGTCGGAGTA
ACTGGTTTTA ATACCGATAA TACGGCTGAA AATACCACGA TAAGAGGAAT AATTTCTCAA
AATTTTGCCG AAGCCAACTC TCCTAATATT TTTTTAAATG CTCAAAACAT TATTTTATCA
GATACGGCTG GGATTCTTAC TTTAGCAATT GGGGCTGGAA AAAGCGGAGA CATATTTATT
ACAGCTAAAG AGTCTTTAAT TGTCGGAAAA AGATCGCCGT TTGAGCCTTT TGCTGTTGGC
AGTCAAATTA TTACACAAAG TTCATCAACT ATGTTTACCG GTGGCGGACA AGGTGGTAAT
ATTTATATTC AATCTCCCCA AGTTATTTTA AAGGATGGCG GTTCTATTCA AACCGTTACT
TATGGGTTAA ATCCAAGCGG AAATATTGAG ATTAAAACTA ACGATATTAG TATTAGTGGC
TTTGTTCCGG CGGATATGGG ATTTTATCCG ACCAGCTTAG GAACAGTAAC ACGGGGACTG
GGAAAATCAG GAAATACAGT GATATCTACT GATAGGTTAA CTGTAACTAA TGGAGCTAGA
ATTAATTCTA CTTCTTTAGG AGCCGGAAAC GGAGGGGATA TTATCATTAA TGCAAACCAG
GGCGTTTTGA TTGAAGATAC TATTTATAGT GGACAGGATT CCTCTAAAAT AATTGCTTCC
GCTAATCAAC TTAACGAATT ATTTTACGAA GTTTTTAAAC TGCCTCGCAA TTTAACGGGA
AATTCTGGAC GAATTTTTTT AAATACGCCC AATTTACAAC TAGCTGATGG AGGTAAAATT
ACTGTTCAAA ATGATGGAAC GGGAGCAGCA GGAATTATCG ATATTACAAG CGAGAATCTA
ACTCTGATTA ATTCTGCTTC TATCGATGCC AGTACGGTTT CAGGAGAAGG AGGAACTATT
TTAATTGATT CAAAATTTAT TCACCTTAAT TCTTCTGCTA TTACGACAAC AGCCGGAGGA
TTAGGAAACG GCGGGGATAT TTCTCTGACA ACCGACTCTC TCATTTTGTC AAATAATAGT
GGAATACAAG CTAATGCTTT TGCCGGCAGA GGAGGAAATA TTGACATTGA AACAAAAGGG
TTTTTTCTAT CTTCCAATAG CCAAATTACA GCTAGCTCTG AATTAGGAAT TGAGGGGGCA
ATTACCATTA ACAATTTTCC TTTTCAGATA CAAGGGGAGC AAGCACAACT TCCTACTCCA
CTCTCTTTAT CAGAAATAGC CGCACAAAGC TGTATTGCTT ATAAATCCCA AACCTATAAA
GTGACCATTC GGGGTCAAGG GACTAATCAA GGCGATCTCA GTTCTCCTAG AGGCTATAAT
TTTTTTGATT TAATCCCTCA AGGACAGGTT GTGGCGGCTG AAAATACATC CTCTGGTGTC
AGACTCCTCA ATTGCGATCA ATATTGGGAG AAGTTACAAC AACAAGATCA AACTCCTGGC
GTTCCTTAA
 
Protein sequence
MKKITSFLSG SLFNLLFLST VAQGQITPDQ TLPNSSFVNE NGNLLLIEGG TERGNNLFHS 
FTEFSLPTGQ IAFFNNDILI QNIFSRVTGN SISHLDGIIK ANGLANLFFL NPNGIIFGPN
ARLAIGGSFI ATTANSIIFD NGYRFSATDP NSSFPVTINV PIGLDFSPET NLAGIIEGFG
TGHSFAGLEQ PQFGGTQFLK GLNVASGKTL AFIGREINFN GFVATAGQVV FDVSSQQLNP
SISGNLEIAS IKEGLIGLDF NNLSEINFNY NQVLSFGNVT LAQKSLLDVS GLNGGNLQIR
AKDLTVTDAS LLMNSNYGDQ MSSVGSININ LTGHLNLVGV TGFNTDNTAE NTTIRGIISQ
NFAEANSPNI FLNAQNIILS DTAGILTLAI GAGKSGDIFI TAKESLIVGK RSPFEPFAVG
SQIITQSSST MFTGGGQGGN IYIQSPQVIL KDGGSIQTVT YGLNPSGNIE IKTNDISISG
FVPADMGFYP TSLGTVTRGL GKSGNTVIST DRLTVTNGAR INSTSLGAGN GGDIIINANQ
GVLIEDTIYS GQDSSKIIAS ANQLNELFYE VFKLPRNLTG NSGRIFLNTP NLQLADGGKI
TVQNDGTGAA GIIDITSENL TLINSASIDA STVSGEGGTI LIDSKFIHLN SSAITTTAGG
LGNGGDISLT TDSLILSNNS GIQANAFAGR GGNIDIETKG FFLSSNSQIT ASSELGIEGA
ITINNFPFQI QGEQAQLPTP LSLSEIAAQS CIAYKSQTYK VTIRGQGTNQ GDLSSPRGYN
FFDLIPQGQV VAAENTSSGV RLLNCDQYWE KLQQQDQTPG VP