Gene PCC7424_3994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_3994 
Symbol 
ID7107239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp4426373 
End bp4428577 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content41% 
IMG OID643482218 
ProductRDD domain containing protein 
Protein accessionYP_002379236 
Protein GI218440907 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00412081 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAGCC CTACGATTAA AAGAGGCAAA AATCGCTCAT CTGGTCAAAC TTTTTCGGTG 
ACAACAGTGT CCCTTCTTCC CCGACGCTGT GCCGCTTGGA TCATGGAAGT TTACCTGGTG
GCTATGAGTG GTATCGTTCC CTATAGCATT GGCGCTTATA TCGAATCTCA TTCCCACAGC
CAAAAAGTTC CTCTTCATCC GGTTTTAGCG TCTTTAGAAG AAGGAATTGC CCAAACTCTT
GCTCTCCCTC AGTCCCAAAC CCAACCGCGC CAAGTCCCCC CTCTAACTAA TCTATTTTGG
GGGTTAGCAT TAGGCACTCC GATCGCTGTA ATCGGATGGC AACTCTATAT TTTAGGGAGA
ACGGGAAGAA CGTTACCGAA ACGATGGTTA GGAGTCAAAG TCGTCACCAC TGGAGGGCGA
TCGCCTTCCG GGTTACAAAT TCTGCTACGA GAAGGGGTAG GGAAATGGGG TTTGCCTCTG
AGTGTCGCCT ATGTTCTTTG GCGGTATTCC CCCGTTTTTC CCAGTGGAGG AGGATTAATC
CTTTTCAGTG GGGTGATGGT TCTTTTAGAG GGAGGATTAT TGTTATTATC CCCTCGTCGT
CGTTGTTTTC ATGACCAAAT TGCCAACACA GTCGTGATAG ACACCAAACA GGGTTATAAA
AAGCGTTCCT CTCGTCCCTC ATCTCCCTCA GTCACCGTAG AAGTTCCCCT TAATTCCCCC
AATTATTCCC AAAAACAGCG CCGAAATCGA TCGCCTGAAC CGGTTAGAAC CCTTGTTTTA
AGACCGACAA TCGAAAAACA ATCCCCCAAT CTTTGGTTAT GGATGCGCCG CCATCCGGGG
ACAACTTTGT TAATCATTAG TTTGGCCGGA ATAACCTTAA TTTTGGCGAC TTTTGTAGGA
ACTCAAGTTT ATATTGAAAA GAAAGCGGAT CAACGACAAT CAGAACAACA AAAAAATCAA
GAATATCAGT TTTTAGTCAC TCAATTGGCG GCTACTTCTG CCGATCCTCT ACAAGAACGA
AAAGCCGTTA TTTTAGCTTT AGCCAGATTA GAAGATCCCC GTTCTATTAC TTTATTAGTC
GATTTGTTAG GACAGGAAAC TAATTTATCT ATTATTACAA CTTTAGAACA AGCGTTTGCC
AGTGTGGGGA CGAAAGCGTT ACCCGCCCTC AGACAACAAA ATCAATCTTT ATCTCAACAG
TTACAGGCGA TCGAAGATCA AACCTCTTCA GATTATCAGT TAACCGCTTG GCGTTTAAGG
GCAACTAAAC AAGCGATCGC TAAAATTTTA GTTCTGCATA ATAATCAACT GAGTCAGGTT
AATTTAAGTC GAGTTGATCT CAATAAAGAC ACCATAGAAA TAGCTCCCTT TACCTTAATA
GCGGAGGAAT TGGATTTATC CGGCATCAAT TTTGAGAATG CCCAATTATC ACAAGCTCGT
TTAAAGGGTA GCGTTTTTGC GAGTGCCGGT CAAGATAAAC ATGAGGATAC CTTTGATGAT
TTAATTGCCA ATTTTAAAGG GGCTAATTTA ACTCAAGCTG ACTTAAGTGA AGCGGTTTTA
CCTTGTGTTT CTCTAGTCGG GGCTAATTTA CAACAGACTA ATTTAAAAAG GTCTAATTTA
AAGCAAGCCA ACCTCCAAAA AGCCAATCTT AGCAGTGTCC AATTACTTCA AGCTAATTTG
CAGCAAGCTA ACTTAAAAGC TGCCAGTTTA ACCGGGGCTG ATTTAACTCA AGCTCAGTTT
AATCAAGCTA ATTTGGAACA GGCTAATTTA GGGCAACTTA ACGCAGTAGG AGCGAATTTT
TCGGAAGCTA ATTTAGCTAA ATCTAACTGG CAAGATTCGG ATTTATCAGG AGTTAATTTT
ACGTCGGCTA ATTTAGAACA AGCGGACTTG AGTTCTACGG TTTTAAAAGG AGTTAATTTC
CGTAACGCTC AATTAAATAA TGCCAATTTA ACCGATGCTA ATCTTTCTCA AGCTGACTTA
CGTTCGGCAA ATTTAGCGGG GGCAAATTTT CACGGTGTTA TTTTTTCTAA AGTTCAATTC
ACTAATACAA ACGGTTTTTT AAAAACACCC CCTAATCATC AATCTGAAGC CATTATAAAA
GGTGTAGATT TTAGTGAGGT CAAAAATTTA AGCCCAACTC AAATCGATTT TATTTGTCAA
AAAGGAGGCA TTCATCCTCA ATGTATAGTA GAAGGGGGAA GATAA
 
Protein sequence
MASPTIKRGK NRSSGQTFSV TTVSLLPRRC AAWIMEVYLV AMSGIVPYSI GAYIESHSHS 
QKVPLHPVLA SLEEGIAQTL ALPQSQTQPR QVPPLTNLFW GLALGTPIAV IGWQLYILGR
TGRTLPKRWL GVKVVTTGGR SPSGLQILLR EGVGKWGLPL SVAYVLWRYS PVFPSGGGLI
LFSGVMVLLE GGLLLLSPRR RCFHDQIANT VVIDTKQGYK KRSSRPSSPS VTVEVPLNSP
NYSQKQRRNR SPEPVRTLVL RPTIEKQSPN LWLWMRRHPG TTLLIISLAG ITLILATFVG
TQVYIEKKAD QRQSEQQKNQ EYQFLVTQLA ATSADPLQER KAVILALARL EDPRSITLLV
DLLGQETNLS IITTLEQAFA SVGTKALPAL RQQNQSLSQQ LQAIEDQTSS DYQLTAWRLR
ATKQAIAKIL VLHNNQLSQV NLSRVDLNKD TIEIAPFTLI AEELDLSGIN FENAQLSQAR
LKGSVFASAG QDKHEDTFDD LIANFKGANL TQADLSEAVL PCVSLVGANL QQTNLKRSNL
KQANLQKANL SSVQLLQANL QQANLKAASL TGADLTQAQF NQANLEQANL GQLNAVGANF
SEANLAKSNW QDSDLSGVNF TSANLEQADL SSTVLKGVNF RNAQLNNANL TDANLSQADL
RSANLAGANF HGVIFSKVQF TNTNGFLKTP PNHQSEAIIK GVDFSEVKNL SPTQIDFICQ
KGGIHPQCIV EGGR