Gene Synpcc7942_1791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1791 
Symbol 
ID3774366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1861831 
End bp1863066 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content55% 
IMG OID637800232 
Producthypothetical protein 
Protein accessionYP_400808 
Protein GI81300600 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00386684 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCTTCT CCACCGCTGC TACCGCCCTG ATTGGACTGG CGATCGCGAC TGGATGTGCC 
AGTTGCAGCT CGGCCTCCTT GCCACCAACT GCTTCAGCCC CTGTGAAGAC GGCAGCGAAG
CCCACGGCTT TATTGGTGCC TGAGTCGGTG CCCGAGCTGA CTACGGTGGA AGTAACGCCC
CTGGTCGAAG GGCTGGAACA TCCCTGGAGT CTGGCTTGGC TTCCCAACGG CGATCTGTTG
ATTACGGAAC GGCCGGGACG GCTGCGATTA GTTCGCCAAG GAAAGCTGGA ACCGACTGCG
ATCGCGGGGT TACCCGCCGA CCTGTTTGCA CAAGGTCAGG GTGGCTTACT AGATATTGCT
GTTGATCCTC AGTTTGAGCA AAATCGCTGG GTCTATTTCA GCTACGCTGC TGGCACCGAA
GCAGTTAATC GGGTTCAAGT GGCGCGGGGT AAGCTGAATG GCCTGCGACT GGAGAATGTA
GAAGTCATCT TCACCGTGCG GCCCGATAAA TCCAGTGCCC AGCATTTTGG CTCGCGGCTG
GCTTGGTTGC CCGATCGCAC CTTATTAATT GCGATCGGCG ATGGCGGTAA TCCGCCCGTG
GAACTCAATG GTCGTTTGAT TCGCCATCAA GCCCAGATGC CCGAGAGCGG TTTGGGTAAA
ATTCATCGCA TCAATCGGGA TGGCTCTATT CCTGCTGATA ATCCGTTTCG GAATCAGCCC
AAGGCTCAAG CCAGCTTATG GAGCCTTGGC CATCGCAATA TTCAGGGCTT AGCAGTTGAT
CCGAAGACTG GAACAGTGTG GTCAACCGAG CATGGATCGC GGGGTGGTGA TGAGTTGAAT
CAAATTAAAG CAGGCGAAAA TTACGGCTGG CCTGAGGTTA CCTTTAGCCA AGAATATTGG
GGCGCTGAAA TTACTCCGCT TCGGACTCAA GCTGGCATGA TTGACCCGCA TTTAGTCTGG
ACGCCAGCGA TCGCTCCTTC TGGGATAACG GTCTATCGTG GTACAAAAGT GCCTGATTGG
CAGGGCAAGA TTTTTGCAGG CGGTTTAGTT GGCCGAGATA TTCGCGTCAT CCAGCTGTCT
CCCGAAGGCC AAGCAACTAA TGTTTCGCGT ATTCCCATCG GAGCGCGAGT TCGGGACGTT
CGCCAAGGCC CCAGCGGCGA TCTCTATGTA TTGACCGATG AGTCTTCGGG CAAGCTCATT
CGTGTGCGAT CGACGACTGC TTCAACTCAA AGCTAA
 
Protein sequence
MTFSTAATAL IGLAIATGCA SCSSASLPPT ASAPVKTAAK PTALLVPESV PELTTVEVTP 
LVEGLEHPWS LAWLPNGDLL ITERPGRLRL VRQGKLEPTA IAGLPADLFA QGQGGLLDIA
VDPQFEQNRW VYFSYAAGTE AVNRVQVARG KLNGLRLENV EVIFTVRPDK SSAQHFGSRL
AWLPDRTLLI AIGDGGNPPV ELNGRLIRHQ AQMPESGLGK IHRINRDGSI PADNPFRNQP
KAQASLWSLG HRNIQGLAVD PKTGTVWSTE HGSRGGDELN QIKAGENYGW PEVTFSQEYW
GAEITPLRTQ AGMIDPHLVW TPAIAPSGIT VYRGTKVPDW QGKIFAGGLV GRDIRVIQLS
PEGQATNVSR IPIGARVRDV RQGPSGDLYV LTDESSGKLI RVRSTTASTQ S