Gene Synpcc7942_0731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0731 
Symbol 
ID3775902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp725474 
End bp727354 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content62% 
IMG OID637799144 
Productputative phage terminase large subunit 
Protein accessionYP_399750 
Protein GI81299542 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCG CTACTGCAAC TGACATTTGC CTCCAAGGTT TCTTGGAGGG ACTGAGGCCT 
GACCCGATTT TGTCGGTGAG TGACTGGGCT GATCAGTTCC GGTTCCTGTC GCAGCGTGCC
TCGGCTGAGC CTGGTCGCTG GCGCACGAGC CGGACTCCGT ATCTGGCGGA GATTATGGAC
AACCTTGGGG CTACCAGCTC TGTCGAAAAG ATTGTGTTTG TCAAAGGCTC ACAAATCGGT
GGATCTGAGG CGGGCTGTAA CTGGATTGGC TACACCATCG ACTTGACGCC CGCGCCGATG
CTGGTGATTC AGCCGACGGT GGACATCGCT AAACGGTTCA GCCGGCAACG GCTGCAGCCG
TTGATTGATG AGACGCCACG CCTGAAAGCA AAGGTTCGGC CAGCTAGGGA GCGGGACAGT
GGCAACAGTG TGCTGTCGAA GGAGTTCCCG GGGGGAATCG TGTTGCTAGC GGGGGCAAAC
AGCGCAGCAG GCCTGCGGTC GATGCCGATC GCGCGGTTGT TTGCGGATGA GGTGGACGCC
TATCCGGGCG ATGTGGACGG GGAGGGCGAC CCAATTGGGC TGGCTGAGGC GCGGACGCGG
ACGTTTAGCC GCCGCAAGAT TCTGCTGGTG AGTACGCCGA CGCGGGCTGG GGTCAGCCGA
ATTTGGCGGG AGTGGGAGCT GTCGGATCAG CGTCGCTATC AGGTGCCCTG CCCGCACTGC
GGTGAGTACC AGGCGATCGC TTGGGACCGG ATTCACTACG ACCCGAACGA CCTCAGCATC
GCGCCAGTGC TGATGTGCGA GCACTGCGGA ACGGGCATTG AGGAACGGCA CAAGCCGAAG
ATGCTGGCCG CTGGTCGCTG GGTGCCGGAG AGCCCTGACA GCGCGGTGCG TGGCTACCAC
TTGAGCAGCC TGTATTCGCC ACTGGGTTGG TTCAGTTGGC GCGATGCGGT GCGGATGTGG
GTCAACGCAA AGACTGACGA ACAGCAGCGA GTGTTTGTCA ACACGGTTCT GGGCGAGTGC
TGGGCTGAGG CTGGCGAAGC GCCTGACTGG GAGCAGCTGT ACCGACGACG CGAGGACTAC
GCGATCGCAA CGGTGCCGCC AGGCGGCTTG GTGCTGACGG CGGGTGTGGA CGTGCAAAGC
GATCGCATCG AGCTGGAAGT GGTGGCCTGG GGGCCAAACC TTGAGAGCTG GTCGGTTGAG
TACGTCGTCA TCCAGGGCGA CACGGCGACG GCTGCCCCGT GGGCTGAACT GGAGAAGCAG
TTGGCGCGCA CCTATCCACG GATAGGTGGG GGTGAGCTAC CGATTGGCAA GGTTTGTGTA
GACACGGGCT ACAACACGAT GGAGGTCTAC GCCTGGTGCC GGAAGCAGTC GGTGAGTCGA
GTGATTCCAA TCAAGGGACG TGACACGCTG ACGACGATTT TGGGCACGCC GAAACTGCAG
GATGTGAGCC TGAAGGGCAA GACGATCAAG GGCGGCATCC GGCTCTGGCC AGTGGGTGTG
AGTGTGGCGA AGAGCGAGCT GTATGGCTGG CTGCGACTAG ACCCGCCACT GAATGACTGC
GACCCGTACC CGAGGGGGTT CATGCACTTC CCGCAGTACG GCGAGGAGTA TTTCGCGCAG
CTGACGGCTG AGGAGCTGCA ATTCAGGTTG GTGAAAGGCT TTAAGCGCTA CGAGTGGGTG
AAGACAAGGC CGCGCAACGA GGCTCTGGAC TGCCGGGTCT ATGCGAGAGC AGCGGCGACG
GCGATGGGGC TGGACCGCTG GCCTGCGGAG CGCTGGCAGG AGATCGCGCG ATCGCTGATG
GATGCGACAC ACCTGCAGCA ACCCAGCAAG CCAGCTACCA AGGTTCAACG CAAGCGGCGA
TCGGGCTGGC TGAGTGAGTA G
 
Protein sequence
MAIATATDIC LQGFLEGLRP DPILSVSDWA DQFRFLSQRA SAEPGRWRTS RTPYLAEIMD 
NLGATSSVEK IVFVKGSQIG GSEAGCNWIG YTIDLTPAPM LVIQPTVDIA KRFSRQRLQP
LIDETPRLKA KVRPARERDS GNSVLSKEFP GGIVLLAGAN SAAGLRSMPI ARLFADEVDA
YPGDVDGEGD PIGLAEARTR TFSRRKILLV STPTRAGVSR IWREWELSDQ RRYQVPCPHC
GEYQAIAWDR IHYDPNDLSI APVLMCEHCG TGIEERHKPK MLAAGRWVPE SPDSAVRGYH
LSSLYSPLGW FSWRDAVRMW VNAKTDEQQR VFVNTVLGEC WAEAGEAPDW EQLYRRREDY
AIATVPPGGL VLTAGVDVQS DRIELEVVAW GPNLESWSVE YVVIQGDTAT AAPWAELEKQ
LARTYPRIGG GELPIGKVCV DTGYNTMEVY AWCRKQSVSR VIPIKGRDTL TTILGTPKLQ
DVSLKGKTIK GGIRLWPVGV SVAKSELYGW LRLDPPLNDC DPYPRGFMHF PQYGEEYFAQ
LTAEELQFRL VKGFKRYEWV KTRPRNEALD CRVYARAAAT AMGLDRWPAE RWQEIARSLM
DATHLQQPSK PATKVQRKRR SGWLSE