Gene PCC7424_1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_1572 
Symbol 
ID7109572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp1733254 
End bp1734780 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content49% 
IMG OID643479838 
Productphotosystem II chlorophyll-binding protein CP47 
Protein accessionYP_002376879 
Protein GI218438550 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.287144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTAC CTTGGTATCG AGTCCACACA GTGGTTCTGA ATGACCCAGG ACGTTTGATC 
TCTGTTCACC TGATGCACAC CGCTCTGGTA GCGGGTTGGG CAGGTTCTAT GGCTCTGTAT
GAGCTAGCAC TATTTGATCC CAGCGATCCC GTTCTTAACC CGATGTGGCG ACAAGGGATG
TTCGTACTTC CCTTTATGGC GCGTTTGGGA GTAACTGGAT CTTGGGGTGG CTGGAGCGTC
ACCGGTGAAA CTGGTGTAAA CCCTGGTTTC TGGTCTTTTG AAGGGGTTGC TGCGGCTCAC
ATCGTTTTAT CTGGGTTGTT ATTCTTAGCA GCCGTTTGGC ACTGGGTTTA CTGGGATCTA
GAACTCTTCG TTGATCCTCG TACTGGTGAA CCCGCTCTTG ACTTACCGAA GATGTTCGGT
ATTCACTTGT TCTTATCTGG TCTTCTTTGC TTTGGGTTTG GTGCTTTCCA CCTGACAGGG
TTATGGGGCC CAGGAATGTG GGTGTCGGAT GCCTACGGGT TGACGGGTCA CGTTCAACCG
GTTGCCCCAG AATGGGGGCC AGCCGGGTTT AACCCCTTTA ACCCCGGTGG AGTTGTGGCA
CACCACATTG CAGCCGGTAT TGTTGGTATT ATTGCTGGTC TATTCCATTT GACGGTTCGT
CCCCCAGAGC GTCTCTATAA AGCTCTGAGA ATGGGGAATA TTGAAACCGT TCTCTCTAGC
AGTATTGCTG CGGTATTCTT TGCTGCTTTT GTGGTAGCGG GAACAATGTG GTACGGAAAC
GCCACCACCC CGATCGAATT ATTTGGCCCT ACCCGTTATC AATGGGATCA AGGCTACTTT
AAGCAAGAAA TTCAGCGCCG AGTACAAACC AGTCTGGCTC AGGGAGATAG CTTGTCTGAA
GCTTGGTCAA AAATTCCTGA AAAGTTAGCT TTCTATGATT ATGTGGGTAA CAGTCCGGCA
AAAGGCGGTT TATTCCGTAC TGGTGCTATG GATAGCGGAG ACGGTATCGC TCAAGCTTGG
CTTGGACATC CCGTATTTAC CGATAAAGAT GGTCGGGTAT TAACCGTCCG TCGGATGCCT
AACTTCTTTG AAACTTTCCC CGTTGTTTTA GCTGATGCTG AAGGGGTAAT TCGCGCTGAT
ATTCCTTTCC GTCGTGCTGA GTCTAAACTC TCTGTTGAGC AAACTGGAGT AACCGTTAGC
TTCTACGGTG GTGCATTAGA TGGACAAACC TTTGACAACC CTGCTGATGT TAAAGTGTTT
GCTCGTAAGG CTCAATTAGG TGAACCCTTC GACTTTGACC GGGAAACCTT AAACTCTGAT
GGGGTATTCC GTACTTCTCC CAGAGGTTGG TTTACTTTTG GTCATGCTTG TTTTGCCCTT
CTGTTCTTCT TCGGTCATAT TTGGCATGGT TCTCGTACTC TGTTCCGAGA TGTATTTGCT
GGTATTGATC CGGATCTTGA GGAACAAGTT GAATTCGGTG TGTTTGCTAA AGTGGGTGAC
TTGAGTACCC GTAAAGAAAC CGCCTAG
 
Protein sequence
MGLPWYRVHT VVLNDPGRLI SVHLMHTALV AGWAGSMALY ELALFDPSDP VLNPMWRQGM 
FVLPFMARLG VTGSWGGWSV TGETGVNPGF WSFEGVAAAH IVLSGLLFLA AVWHWVYWDL
ELFVDPRTGE PALDLPKMFG IHLFLSGLLC FGFGAFHLTG LWGPGMWVSD AYGLTGHVQP
VAPEWGPAGF NPFNPGGVVA HHIAAGIVGI IAGLFHLTVR PPERLYKALR MGNIETVLSS
SIAAVFFAAF VVAGTMWYGN ATTPIELFGP TRYQWDQGYF KQEIQRRVQT SLAQGDSLSE
AWSKIPEKLA FYDYVGNSPA KGGLFRTGAM DSGDGIAQAW LGHPVFTDKD GRVLTVRRMP
NFFETFPVVL ADAEGVIRAD IPFRRAESKL SVEQTGVTVS FYGGALDGQT FDNPADVKVF
ARKAQLGEPF DFDRETLNSD GVFRTSPRGW FTFGHACFAL LFFFGHIWHG SRTLFRDVFA
GIDPDLEEQV EFGVFAKVGD LSTRKETA