Gene PCC7424_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_4047 
Symbol 
ID7107293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp4484125 
End bp4486368 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content42% 
IMG OID643482272 
Productglycoside hydrolase family 57 
Protein accessionYP_002379289 
Protein GI218440960 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.422577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTACC CTTTATACGT TGCTTTTATT TGGCATCAAC ACCAACCTCT GTATAAGTCA 
CCAGAGGTAG CTAGTGATGC TTCTGGACAG TATCGTCTTC CTTGGGTGCG TCTTCATGGG
ACTAAGGATT ATTTAGATTT AGTCCTGATT CTAGAACGTT ATCCTAAGCT TCATCAAACC
GTTAATTTAG TTCCCTCTTT GATCTTACAA CTCGAAGAGT ATGCCGCCGG AACAGCACTA
GATCCCTACA TTGCTTTAAC GTTAACTCCA GAAGCTCAAC TGAGTGAAAA TCAGAAAAAA
TTTATTATCG AGCATTTTTT TGATGCCAAC CATCACACTC TCATCGATCC TCATCCTCGC
TACGCTCAAT TATATACTCA ACGCCAAGAA GAAGGACGGA GATGGTGTTT ATCGAACTGG
ACTGATCAAG ATTATAGTGA TTTACTGACT TGGCATAATT TAGCTTGGAT CGATCCTTTA
TTTTGGGATG ATCCAGACAT TGAAACTTGG TTAAGACAAG GACAAAATTT TACTCTAGGC
GATCGCCAAC GTCTATTTAC GAAACAAAAA GAAATTATCA GTCGCATCAT TCCCCAACAC
CGGAAGATGC AGGAAACCGG ACAATTAGAA GTCACAACTA CCCCTTATAC TCATCCCATT
TTACCCTTAC TGGCCGATAC TAATGCGGGT CTAGTGGCTG TTCCCAATAT GAGATTACCC
GAAAGGCGTT TTCAGTGGGA AGAGGATATC CCTCGTCATT TACGCAAAGC TTGGGATATT
TATCTCGATC GCTTTGGTAG AGAACCTAGG GGGTTATGGC CGTCTGAACA GTCAGTTAGT
CCGGCGATTT TACCCCATAT TAGCAAAGCC GGGTTTAAAT GGATTTGTTC TGATGAGGCT
GTATTAGGTA ATTCTCTCAA AACCTATTTC CATCGAGATG AAACCGGAAC CGTTGAAGAT
GGAGAGTTAT TATATCGTCC CTACCGTTTA GAAACTCCTA ACGGAGATTT AGCGATCGTT
TTTCGGGATC ATCGGTTGTC TGATTTAATA GGGTTTAGTT ATAGTGGGAT GGACTCTAGA
AGTGCGGCCT TAGATTTAGT GGCACATTTA GAAGGGATTA TCCGGTCTTT GGTCAATAAA
CAAGAAGATG GGGTAACATT ACAACAACCC TGGTTAGTGA CCATTGCTTT AGATGGGGAA
AATTGTTGGG AAAGTTATCA TCGAGATGGG TGGCCTTTCT TAGATACACT CTATCAAAAA
TTAAGCGACC ATCCAGAGAT TCAATTAGTG ACTGTTTCTG AGTTTCTCGA TAAATTCCCG
CCGACAGAAA CTATACCCGC CGAGAGTTTA CATAGTGGAT CTTGGGTAGA TGGAAGTTTC
ACCACTTGGA TAGGAGATCC GGTTAAAAAT AAAGCTTGGG ATCTCCTGAC GGATGCTAGA
GAAGTCTTAG CAAAACATCC AGAAGCAACC GAAGAAAATA ATCCGGATGC TTGGGAGGCG
TTATATGCGG CGGAAGGTTC AGATTGGTTT TGGTGGTTTG GGGAAGGTCA TTCTTCTAAT
CAGGATGCCC TATTCGATCA ATTGTTCCGT TCTCACCTAG CCGGCATTTA CAGCGCTCTT
AATGAACCGA TTCCCCCTAT TTTACATCAC CCGTTAGAAG ATCATCACAG AAAACGCCAA
CATCGCCCAG AAACCTTTAT TCATCCCATT ATTGACGGGT TTGGGGATGA GCAAGACTGG
GACAAAGCCG GTCGGATCGA AATGGGGGGA TCGAGTGGTA CAATGCACCG ACGGGAAGTC
GTGCAACGAT TGTTTTATGG TTGGGATCAC TTGAATTTTT ATCTCCGGTT GGATTTTCAG
CCCGGAGTTG TGCCGGGTCG GGATATTCCC CTAGAATTGC ATTTACTATG GTATTATCCG
GGGGCAGAGC GTCATAATTG TCCTGCACCT TTAGCAGATT TACCGGATCA AGCACCTTTA
AATTTTCATT TTCATCATCA TTTAGGAATT AATTTAATGA CTGAGTCGGT TTGGTTAGAA
GAAGCTGATG AAAAAGTCTT TTGGAAAGCG AGAAATACTC ATGCTATGGC GGCGTTTAAT
GAATGTTTAG AAATATCTGT TCCTTGGGAT GATTTAAGAC TTGAACCGGA TTGTCATTTG
CATTTGGTGG CAGTTTTAGC CGATCAAGGG CAATTTAAAA CTTATTTACC AGAAAATGAG
GTAGTTATGC TTCAAATGCC TTAA
 
Protein sequence
MSYPLYVAFI WHQHQPLYKS PEVASDASGQ YRLPWVRLHG TKDYLDLVLI LERYPKLHQT 
VNLVPSLILQ LEEYAAGTAL DPYIALTLTP EAQLSENQKK FIIEHFFDAN HHTLIDPHPR
YAQLYTQRQE EGRRWCLSNW TDQDYSDLLT WHNLAWIDPL FWDDPDIETW LRQGQNFTLG
DRQRLFTKQK EIISRIIPQH RKMQETGQLE VTTTPYTHPI LPLLADTNAG LVAVPNMRLP
ERRFQWEEDI PRHLRKAWDI YLDRFGREPR GLWPSEQSVS PAILPHISKA GFKWICSDEA
VLGNSLKTYF HRDETGTVED GELLYRPYRL ETPNGDLAIV FRDHRLSDLI GFSYSGMDSR
SAALDLVAHL EGIIRSLVNK QEDGVTLQQP WLVTIALDGE NCWESYHRDG WPFLDTLYQK
LSDHPEIQLV TVSEFLDKFP PTETIPAESL HSGSWVDGSF TTWIGDPVKN KAWDLLTDAR
EVLAKHPEAT EENNPDAWEA LYAAEGSDWF WWFGEGHSSN QDALFDQLFR SHLAGIYSAL
NEPIPPILHH PLEDHHRKRQ HRPETFIHPI IDGFGDEQDW DKAGRIEMGG SSGTMHRREV
VQRLFYGWDH LNFYLRLDFQ PGVVPGRDIP LELHLLWYYP GAERHNCPAP LADLPDQAPL
NFHFHHHLGI NLMTESVWLE EADEKVFWKA RNTHAMAAFN ECLEISVPWD DLRLEPDCHL
HLVAVLADQG QFKTYLPENE VVMLQMP