Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0731 |
Symbol | |
ID | 3775902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 725474 |
End bp | 727354 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637799144 |
Product | putative phage terminase large subunit |
Protein accession | YP_399750 |
Protein GI | 81299542 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATCG CTACTGCAAC TGACATTTGC CTCCAAGGTT TCTTGGAGGG ACTGAGGCCT GACCCGATTT TGTCGGTGAG TGACTGGGCT GATCAGTTCC GGTTCCTGTC GCAGCGTGCC TCGGCTGAGC CTGGTCGCTG GCGCACGAGC CGGACTCCGT ATCTGGCGGA GATTATGGAC AACCTTGGGG CTACCAGCTC TGTCGAAAAG ATTGTGTTTG TCAAAGGCTC ACAAATCGGT GGATCTGAGG CGGGCTGTAA CTGGATTGGC TACACCATCG ACTTGACGCC CGCGCCGATG CTGGTGATTC AGCCGACGGT GGACATCGCT AAACGGTTCA GCCGGCAACG GCTGCAGCCG TTGATTGATG AGACGCCACG CCTGAAAGCA AAGGTTCGGC CAGCTAGGGA GCGGGACAGT GGCAACAGTG TGCTGTCGAA GGAGTTCCCG GGGGGAATCG TGTTGCTAGC GGGGGCAAAC AGCGCAGCAG GCCTGCGGTC GATGCCGATC GCGCGGTTGT TTGCGGATGA GGTGGACGCC TATCCGGGCG ATGTGGACGG GGAGGGCGAC CCAATTGGGC TGGCTGAGGC GCGGACGCGG ACGTTTAGCC GCCGCAAGAT TCTGCTGGTG AGTACGCCGA CGCGGGCTGG GGTCAGCCGA ATTTGGCGGG AGTGGGAGCT GTCGGATCAG CGTCGCTATC AGGTGCCCTG CCCGCACTGC GGTGAGTACC AGGCGATCGC TTGGGACCGG ATTCACTACG ACCCGAACGA CCTCAGCATC GCGCCAGTGC TGATGTGCGA GCACTGCGGA ACGGGCATTG AGGAACGGCA CAAGCCGAAG ATGCTGGCCG CTGGTCGCTG GGTGCCGGAG AGCCCTGACA GCGCGGTGCG TGGCTACCAC TTGAGCAGCC TGTATTCGCC ACTGGGTTGG TTCAGTTGGC GCGATGCGGT GCGGATGTGG GTCAACGCAA AGACTGACGA ACAGCAGCGA GTGTTTGTCA ACACGGTTCT GGGCGAGTGC TGGGCTGAGG CTGGCGAAGC GCCTGACTGG GAGCAGCTGT ACCGACGACG CGAGGACTAC GCGATCGCAA CGGTGCCGCC AGGCGGCTTG GTGCTGACGG CGGGTGTGGA CGTGCAAAGC GATCGCATCG AGCTGGAAGT GGTGGCCTGG GGGCCAAACC TTGAGAGCTG GTCGGTTGAG TACGTCGTCA TCCAGGGCGA CACGGCGACG GCTGCCCCGT GGGCTGAACT GGAGAAGCAG TTGGCGCGCA CCTATCCACG GATAGGTGGG GGTGAGCTAC CGATTGGCAA GGTTTGTGTA GACACGGGCT ACAACACGAT GGAGGTCTAC GCCTGGTGCC GGAAGCAGTC GGTGAGTCGA GTGATTCCAA TCAAGGGACG TGACACGCTG ACGACGATTT TGGGCACGCC GAAACTGCAG GATGTGAGCC TGAAGGGCAA GACGATCAAG GGCGGCATCC GGCTCTGGCC AGTGGGTGTG AGTGTGGCGA AGAGCGAGCT GTATGGCTGG CTGCGACTAG ACCCGCCACT GAATGACTGC GACCCGTACC CGAGGGGGTT CATGCACTTC CCGCAGTACG GCGAGGAGTA TTTCGCGCAG CTGACGGCTG AGGAGCTGCA ATTCAGGTTG GTGAAAGGCT TTAAGCGCTA CGAGTGGGTG AAGACAAGGC CGCGCAACGA GGCTCTGGAC TGCCGGGTCT ATGCGAGAGC AGCGGCGACG GCGATGGGGC TGGACCGCTG GCCTGCGGAG CGCTGGCAGG AGATCGCGCG ATCGCTGATG GATGCGACAC ACCTGCAGCA ACCCAGCAAG CCAGCTACCA AGGTTCAACG CAAGCGGCGA TCGGGCTGGC TGAGTGAGTA G
|
Protein sequence | MAIATATDIC LQGFLEGLRP DPILSVSDWA DQFRFLSQRA SAEPGRWRTS RTPYLAEIMD NLGATSSVEK IVFVKGSQIG GSEAGCNWIG YTIDLTPAPM LVIQPTVDIA KRFSRQRLQP LIDETPRLKA KVRPARERDS GNSVLSKEFP GGIVLLAGAN SAAGLRSMPI ARLFADEVDA YPGDVDGEGD PIGLAEARTR TFSRRKILLV STPTRAGVSR IWREWELSDQ RRYQVPCPHC GEYQAIAWDR IHYDPNDLSI APVLMCEHCG TGIEERHKPK MLAAGRWVPE SPDSAVRGYH LSSLYSPLGW FSWRDAVRMW VNAKTDEQQR VFVNTVLGEC WAEAGEAPDW EQLYRRREDY AIATVPPGGL VLTAGVDVQS DRIELEVVAW GPNLESWSVE YVVIQGDTAT AAPWAELEKQ LARTYPRIGG GELPIGKVCV DTGYNTMEVY AWCRKQSVSR VIPIKGRDTL TTILGTPKLQ DVSLKGKTIK GGIRLWPVGV SVAKSELYGW LRLDPPLNDC DPYPRGFMHF PQYGEEYFAQ LTAEELQFRL VKGFKRYEWV KTRPRNEALD CRVYARAAAT AMGLDRWPAE RWQEIARSLM DATHLQQPSK PATKVQRKRR SGWLSE
|
| |