Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_4937 |
Symbol | |
ID | 7107003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | - |
Start bp | 5476693 |
End bp | 5479851 |
Gene Length | 3159 bp |
Protein Length | 1052 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643483149 |
Product | hypothetical protein |
Protein accession | YP_002380159 |
Protein GI | 218441830 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.24473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCG AGCCATCAAC CGTCCTAAAA ACCGCCTGGC AACGCTACGC CCAACTTAAA ACCAATGCGG CGATCGCCTC AAAACAATAT CTCAGCCTAC GGCTTTGGGT GATGGCTAGT GCTTTACTCA CTATAGGGTT AGCCATCCTC GTGTGTATCA ATGATTGGTC TTTAGTTGCC TCTCCCCTGA CAGAAGCCCT CGAAGGCAGT TTAATTATCA TTCCCATCGC CACTTTAGTG ATTTGGTCTT TTACTAACAG ATTACGACAA GGGCAATATT GGCAAGTGTT TAACACCAGT GCCGCCGCCA TTATTAATGC TATCTATCTC TATCGAACCC TCTTGATCGG ACAAAGCGAT CGTCATTTAT GGCTTCAAGA TCGAATCACC CTTATTTACG ATCAAGTCAT GGAACAGATA GGAGGAAATT TAGTTTTAAA ACCCTATCAA GACTCTCTCC CTCCTAGCGA CGCTCCAGAA ATTGAACAGG GAGACTCAGG ATGGAATGAT TTATCCGTAG AAGATTATAT TAGTTATCGT TTAGACGCTC AATTAAACTG GCACGATCGA GCGGTTAAAA AGTTAGATAT CACCCGGAAT AATCTTCGCC TGGCCATAGT TTTTTGTTTA GGATTAAGCC CTCTTTTCCC GATTTTAGGG GGTAAATATC TCCTGTGGAT TGCCTTAACA ACCACCTCGG CAATTATTTT AATGAGTTTA TTAGAAGTTA ATCGGATCGA TGATTACATT CATCAATATA ATCAATTAGT TTGGCAATTA AGAGCGATTC GGGATACCTG GCAAACTCTC GAAAAACAAA AATGTACCAC TGAACATATT GTTAGATTAA TCCTCTCAAC AGAAACCTTG TTAGGCAATG TTGCCACCTC GATGACTCAC TCTCAAAGCC ATAGTTTAAT TACAATTAGC GAAAAAAATA CCCCCGATTT ATTAACTCAA GTCGTCAATA CTCTTCCTTT AACATTAGTC AGTCTTTGTG CCCTTCCCCA AGCTTCCGAC AGGCAAAATC TACTCACCTC AGCCGGAGAA GTGATTAATA GTGCTGGCAA AGATATTCTG AAAAAAGAAC CCGAAATAGA ACTGGTCAAA AAAGAAATTC TTCATGCTTT TATCGTGATG CCTTTTGGAC GAAAACAAGG AAAAGATGGG CGTTGGATAG ATTTTGATAC TATTTATTAC ACTCTCATTA AACCGGCGGT AGAAGCGGCA GGATTTGAAT CATTTCGTGC TGATGAAGAA ACCGTCACCG GAGATATTTT AACCGATATG TTACAAGAAT TATTATTAGC TGATCTGGTG ATAGCGGATT TGAGTATTGA TAACGCCAAT GTTTTTTATG AAATTGGGAT TCGTCACGCC CTCCGCAAAC GAGGCGTGGT TCATATTCAA TGTGGACGAG CTTATTTACC CTATGATATT TTTAATGTTC GGACTTTAGC TTATCATTGT AATGAAAATG GTCAGCCGGA TTTAGATTAT TTAGAAAAAG ATAAACAGGC ACTGACTAAC ATGATTCAAA AAACCTGGGA ATCAGAACCC ACTCGGATTC ATAGTCCTTT ATTTAATTTA TTAACAGGTT TACCCGAACC CAATCGGAAA TTATTACAAA CTCCTTTAGC AACCGGGTAT TGGCAAGAAT ATACAGAATG GCAAAAACGA GTTACCATTG CTCAACAGAA AAAACAAATC GGTGATGTAC TGTTATTAAC CGAAGAAGTT AAAAATCCTT TTATTAAATG CGAAGTAATC GCGGAATCCG GCCAAGTCTT AAAACGATTA AATAATCATG CTTTAGCTCT AAAAGAATAT CGACAAGGAT TAAAAATAGA TCCTAAAAAT CCTATATTTC GCTTAGAAGA AGCTTATCAT CTCAGTCGCT TAAATCAATA CGATGAAGCC ATTGTTAAAT TAGAATGGTT ACTGCAAGAT GACCCTAAAA ATATGGATGC TCTTTCTTAT TTAGGACGCA TTTATAAAGA TGTTTGGCGA GAAGAATGGG AACATATCAC TGATGAACAA GAACGTCTCA AACAAGCCTA TGAATCAGCT TATTTACTAG AAAGATCGAT AGAAACTTAT CTCCAAGCCT ATCAACTCAA TCAAAATCAT TATTATTCAG GGATTAATGC TGTTACTTTA TTATTTATTT TAGATTACTT AGCTCAACAA TATAGCCAAG AAAATGAGCC AGATTATAAA ATTTTAAGAG AGCAATTGCC GTCTTTATGT GGAGCGATCG AATTTTGTCT TAATAGTCAT GCTAAAATAA CTCCGACCGA TTTTTGGGTG TTTTTATCTC TCGGTGATTT AGCCGTTTGT AATGCGTCCT CTCCTAAAGA AGTGACCCGG AAGTATAAAA AAGCCTTAAC CTTACTTTGG AATAATAAAT TTGCTCTACA ATCAACCCTA ACTCAACTCA AATTATTAGA AACCTTAAAT TTTCGTCTGG ACTATGTTCA GGCAGGAATA ACCTTATTAA AAGCCGAATA TGAAAGAATA GAAAAACAAG AGAAAACCGT ATCTCTACAA ACAGAGACAG ACCCCTTACA GGTGTTTTTA TTTTCGGGTC ATATGATCGA TAATCCAGAA CGAACAAAAC CCCGATTTCC CGCCGACATG GAAACCGAAG TTCAGAGTAA AATTAAGGCA GTCTTAAAAG AATTAAATGC CAATGAAAAT GATTTATGTA TTACGGCGGG GATTGCTTGT GGAGGAGATA TTATTTTTAT AGAAATTTGT TTACAACTTA ACATGACAGT TGAAATTTAT CTGCCCTTTC CTCTAGAAGA ATTTATCCAA CAATCCGTCA GTTTTGCTGG AGATGATTGG GTCGAACGAG TTTATAAAAT TAAAAATCAT CCGAATGTGA CTTTTCATTT TCAACCTGAA CGACTAGGAG CATTACCAAA GGGAGATAAT CCCTTTTCTC GTAATAATCG TTGGGCATTT TATTCAACTT TAATGTATGG AATCGATAAA GTGAGGCTCA TTGTTTTATG GGATGGTAAA GGAGGAGATG GCCCCGGCGG AACTCAAGAT ATGGTTAACC AAGTTCGTCA ATTTGGGGGT ATTGTGGAAC ATTTAGACAC CACTAAGTTT GATTATTGGG ACAAAGTTAA ATTATTTCAT CGGGAATAG
|
Protein sequence | MNTEPSTVLK TAWQRYAQLK TNAAIASKQY LSLRLWVMAS ALLTIGLAIL VCINDWSLVA SPLTEALEGS LIIIPIATLV IWSFTNRLRQ GQYWQVFNTS AAAIINAIYL YRTLLIGQSD RHLWLQDRIT LIYDQVMEQI GGNLVLKPYQ DSLPPSDAPE IEQGDSGWND LSVEDYISYR LDAQLNWHDR AVKKLDITRN NLRLAIVFCL GLSPLFPILG GKYLLWIALT TTSAIILMSL LEVNRIDDYI HQYNQLVWQL RAIRDTWQTL EKQKCTTEHI VRLILSTETL LGNVATSMTH SQSHSLITIS EKNTPDLLTQ VVNTLPLTLV SLCALPQASD RQNLLTSAGE VINSAGKDIL KKEPEIELVK KEILHAFIVM PFGRKQGKDG RWIDFDTIYY TLIKPAVEAA GFESFRADEE TVTGDILTDM LQELLLADLV IADLSIDNAN VFYEIGIRHA LRKRGVVHIQ CGRAYLPYDI FNVRTLAYHC NENGQPDLDY LEKDKQALTN MIQKTWESEP TRIHSPLFNL LTGLPEPNRK LLQTPLATGY WQEYTEWQKR VTIAQQKKQI GDVLLLTEEV KNPFIKCEVI AESGQVLKRL NNHALALKEY RQGLKIDPKN PIFRLEEAYH LSRLNQYDEA IVKLEWLLQD DPKNMDALSY LGRIYKDVWR EEWEHITDEQ ERLKQAYESA YLLERSIETY LQAYQLNQNH YYSGINAVTL LFILDYLAQQ YSQENEPDYK ILREQLPSLC GAIEFCLNSH AKITPTDFWV FLSLGDLAVC NASSPKEVTR KYKKALTLLW NNKFALQSTL TQLKLLETLN FRLDYVQAGI TLLKAEYERI EKQEKTVSLQ TETDPLQVFL FSGHMIDNPE RTKPRFPADM ETEVQSKIKA VLKELNANEN DLCITAGIAC GGDIIFIEIC LQLNMTVEIY LPFPLEEFIQ QSVSFAGDDW VERVYKIKNH PNVTFHFQPE RLGALPKGDN PFSRNNRWAF YSTLMYGIDK VRLIVLWDGK GGDGPGGTQD MVNQVRQFGG IVEHLDTTKF DYWDKVKLFH RE
|
| |