Gene PCC7424_1755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_1755 
Symbol 
ID7110413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp1954577 
End bp1957474 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content44% 
IMG OID643480017 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_002377058 
Protein GI218438729 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.655087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACTT TATCCCGATT GGCTATCGCT ACAAGCTGCT GTCTTGTGAT AGGAGTATGT 
TCTCCAGACA CAGCCCAGAG CCAAATCGTA CCAGACACGA CCTTATCTAA TCCCTCCCTG
GTCACCAATA CGAGTAATAC TCAAGAAATT ACAGGAGGAA CAATAGTCGA TAATCATCTT
TTTCATAGCT TTGAACAATT TTCTGTTCCT ACAAACACTG AAGCTTATTT TAATAATAGT
TTAAATATTA TTAATATTTT TAGTCGCATT ACGGGCAATT CTATTTCTAT AATTGATGGA
TTAATTAAAG CCAATGGGAC AGCGAATTTG TTTTTGCTCA ATCCCAATGG AATTCTTTTC
GGCCCTAATG CTTCCTTAAA TATTGGAGGT GCATTTGTGG CTAGCACAGC TAATAGTATT
CAATGGGCTA ATAATCAAGA ATTTAGTGCA GTTAATCCCC AAGATGTCTC CCCTTTATTA
ACGATGAACG TACCAATGGG GCTACAATAC GGAGCTAATC CAGGCAAAAT AGTCGTTCAA
GGAACGGGAA ATAATTTAGG ACTCAGTGAG AATTTTGAAA TTATTCGGGA TTTTCGCCCC
TCTGGCCTAG AAGTTCAGCC GGGTCAAAGC TTAATTTTAG CGGGAGGAGA CATAGAATTA
GAAGGAGGAA ATATCACGGC TTCCGGAGGA CGAATAGAAC TTTGGTCAAT CAACAATGGC
TCTGTTGAAA TCTCCAATAT CAACGGACAG ATACAACTGA CTTCTAGTGC CCAAACCGTT
GAATCTGGCA ATATTCAACT TTTAGGAGCG GCTTCTGTAG ATGTAAGCGG GAGTCGAGGC
GGAGATATTC AAATTCAAGC AGAGAATTTA AGTTTAAGGG AAGGCTCAGT TATTTTAGCC
GATACTCTAG GAAATCAGAC CGGGGGAAAC GTGATCATTA AAGCCAATAA ATCTCTACAA
CTTGCAGGAA CTTCTCCCTT TAGCCCAATT TTTAGCGGAA TTTTTACCGA TGCGGCCTCA
ACCGCTACAG GAAAAGGAGG AAATTTAACC CTTGAAACCG GCTATTTACA ACTGGCCAAT
GGGGCGCAAA TTAGCACGAA TACCTTTGGA GTCAGTGAGG GAGGAACATT AACCCTCAAA
GCCAATCAGA TTGATTTGAT TGGGGGTTCA AGCTTTGGCC CTAGTGGGTT ATTTGCTGTA
GTAGCTCCCG AAGCTACTGG CCAGGGAGGC AATTTATCAC TCATAACCGA TGGACTCTCT
CTAGTGGGTG GGGCACAAGT TTCCGTTGGG ACGTTTGGCA GTGGTAACGC CGGGACTTTG
ACCCTTAATG CCTCTAGGAT AAGTATCACA GGAACTTCTC CAGGAGGCAA TGCTAGCAGT
ATAGTCGGAA ATGTAATCCC AACAGCAACC GGCAACGGTG GCAATATCTT TATTCAAACC
AATTCTCTTA ATTTAATCGA TGGAGGACAA ATCGGCAGTG CGACTTTTGG GGTAGGAAGC
GCCGGCAGAT TAACGATTAA CGCACAAGAA ATACAGATAA TCGGGGGAAC GCCCTCAGCC
CCAAGCGGAG TCTACACCAC AGTTGAACCA AACGCAACCG GCAACGGAGG ATCACTGACT
CTAGAAACGG AATCTTTACG CATAGTCGAT GGGGGACAAA TTGCCGTTAG TACCGCAGGA
ACGGGAACAG CCGGAAATTT AGTAATACAA GCAGATACTA TACAACTGAT CGGCTCGTCA
GAGTTTGGAG CTAGTGGTTT ATTTGCTAAC GCCATCGCCG ACACAGGACA AGGGGGCAAT
ATTTCCTTAA CCACAGATCA ATTAATCCTC CAAGACGGAG CAACCATTAA TTTAAGTAAC
TTTTCCAGTC GTAATCCCAC TATTCCTGCC GGCCAGGGTT CACCCGGTAA TCTCAACATT
CGCGCCGGTT CAATTTTACT GGACAATCAA AGCACCCTTA CCGCCGCTAC CCTTGTGGGC
AGTCAAGGCA ATATTACCCT TTTATCTGGG GATCTTCAAC TGCGTCAAGG CAGTCTCATT
ACTACGAATG CCTCTGGCCA GGGAAGTGGC GGAAACATTA TCATTAATAG CGATCGTCTA
ACGATGACCG GAAACAGCGA AATTTCTGCG AGTGCTACCG GCATAGGTGC AGGAGGAAAT
GTAGAAATTA TCACCCTTTC CCCTCTATTG CTCAATCAAA GTACCATTAG TGCCACTGGA
GGACAAGGAA ATATTTTCCT ACAATCTCCT GGGGTAGAAC TTCGCAACGG GAGTAATTTA
AGTACCAACG GCATCGGCGA TGCTCCTGGA GGTAACATTA TCATCAATAC CGCCTATCTT
TTTGCTGTTC CTCAAGAAAA TAGCGATATT ACCGCTAATG CTCAAAATAG TTTTGGCGGT
CGAGTGATTA TCTTTACCGA TGGGATTTTT GGCCTCGAAG TCCGAGAAAA CCTTACCCCT
CTGAGTGATA TTACGGTGTC TTCTCAATTA GGGCTAGAGT TTGGGGGACT TGTTGAGATC
AACAATCAAG GGTTTGATCC CCGTTCAGCC TTAGTTCAAC TGTCCACAGA AGTGGTTGAT
GCTAGCGATC AAATCGCTCA AGGTTGTGAT GCTCAACAAG GGAATGTTTT TGTAGTCACC
GGACGAGGAG GACTACCCGA AAATCCCAAT CAAACCTTAC AAAGTCAGTC CCTCTGGCAA
GATTTAAGAA CTGTCTCAGA AAATCCCCCT CAATCTTCCT CCTCCTCATC CGCTCAAACT
CCTGACCATA GGATGATTAC AGAAGCTCAA GGATGGATTA TTAATCCTGA TGGTCAAGTC
GATCTCATTG CCCAAGCCCC TACTCAAACA TCATGGCATC AACCGAGCCA ATGTCACTTA
AAAGGAGGAA AAAAATAG
 
Protein sequence
MKTLSRLAIA TSCCLVIGVC SPDTAQSQIV PDTTLSNPSL VTNTSNTQEI TGGTIVDNHL 
FHSFEQFSVP TNTEAYFNNS LNIINIFSRI TGNSISIIDG LIKANGTANL FLLNPNGILF
GPNASLNIGG AFVASTANSI QWANNQEFSA VNPQDVSPLL TMNVPMGLQY GANPGKIVVQ
GTGNNLGLSE NFEIIRDFRP SGLEVQPGQS LILAGGDIEL EGGNITASGG RIELWSINNG
SVEISNINGQ IQLTSSAQTV ESGNIQLLGA ASVDVSGSRG GDIQIQAENL SLREGSVILA
DTLGNQTGGN VIIKANKSLQ LAGTSPFSPI FSGIFTDAAS TATGKGGNLT LETGYLQLAN
GAQISTNTFG VSEGGTLTLK ANQIDLIGGS SFGPSGLFAV VAPEATGQGG NLSLITDGLS
LVGGAQVSVG TFGSGNAGTL TLNASRISIT GTSPGGNASS IVGNVIPTAT GNGGNIFIQT
NSLNLIDGGQ IGSATFGVGS AGRLTINAQE IQIIGGTPSA PSGVYTTVEP NATGNGGSLT
LETESLRIVD GGQIAVSTAG TGTAGNLVIQ ADTIQLIGSS EFGASGLFAN AIADTGQGGN
ISLTTDQLIL QDGATINLSN FSSRNPTIPA GQGSPGNLNI RAGSILLDNQ STLTAATLVG
SQGNITLLSG DLQLRQGSLI TTNASGQGSG GNIIINSDRL TMTGNSEISA SATGIGAGGN
VEIITLSPLL LNQSTISATG GQGNIFLQSP GVELRNGSNL STNGIGDAPG GNIIINTAYL
FAVPQENSDI TANAQNSFGG RVIIFTDGIF GLEVRENLTP LSDITVSSQL GLEFGGLVEI
NNQGFDPRSA LVQLSTEVVD ASDQIAQGCD AQQGNVFVVT GRGGLPENPN QTLQSQSLWQ
DLRTVSENPP QSSSSSSAQT PDHRMITEAQ GWIINPDGQV DLIAQAPTQT SWHQPSQCHL
KGGKK