Gene Synpcc7942_2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2402 
Symbol 
ID3774686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2476025 
End bp2477155 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content58% 
IMG OID637800850 
Producthypothetical protein 
Protein accessionYP_401419 
Protein GI81301211 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00162245 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000927611 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGGGGGG ATTTCGCACC GAACGACCGT ACCCAAACAA CCGAAGCCGC TAGGCTGAAC 
AAGGGTCTTT TTGCTAGTCA AATCATGAGT CAGGGGCAAC AGAGCGATCG CCGTTTTCCA
GCGGAATGGG AAGCGCAGGA CGGTGTTTTG ATTGCTTGGC CCCATGCGGA CAGTGATTGG
CGATCGCTGT TGGATCAGGT CGATCGCGTC TACCGTGACC TAGCGGAAGC AGTGACGCGC
TTTGAGCAGT TGCTGATCGT GACGCCAGAG CCCGATCGCG TGGCAGAACA ACTGGCGCAG
ACCACTGCGA ATTGCGATCG CATTCAGATT GTTGAACTGC CCACCAATGA CACTTGGAGC
CGCGATTTTG GGCCGCTGAC GGTGGAAACC CCAGCAGGGC TGCGACTGCT GGATTGGGGC
TTCAATGGCT GGGGCCTAAA GTTTGCCGCC AACCACGACA ACCAAGTGAC ACGGCGACTT
TGGCAGCAAG GAATTTTTGG CACCACGCCG CTGGAAACTG TGCCGCTGAT TTTTGAAGGG
GGCAGTATCG AAAGTGATGG ACGCGGCACC CTACTCACCA CGAGTCAGTG TTTGCTGGAA
GCGAATCGTA ATCCGGGCTT GAGCCGCGAG GCGATCGCGC AGATCATCCA GCGACAACTG
GGGGGCGATC GCCTGCTCTG GTTGGAGCAT GGCCATTTGG AAGGCGATGA CACGGACGCC
CACATCGACA CGCTGGTGCG GATCGCACCT AATGACACGC TGATTTACGT TGCCTGCGAC
GATCCCAGTG ACAGTCATGC GGCAGAACTA ACAGCCCTTG AAGCAGAACT CAAAGCCTTG
CGCGCGGCGG ACGGACAGCC CTACCACTTG ATTCCCTTGC CTTGGCCGCA ACCCTGCTTC
GATGCGGATG GTCAGCGCTT GCCGACAACC TACGCCAATT ACTTAGTCAT CAATGGCGCA
GTCTTAGTAC CGACTTACAA CGATCCGGCA GATGAGGCGG CGATCGCGGC AATCGCCGCT
GCTTTCCCCG ATCGCCTCGC GATCGGCATC AACTGTCGGC CACTCCTGGA GCAACATGGC
TCACTGCATT GCATCACGAT GCAACTGCCT GCTGGACTTC TCAGTCGCTA A
 
Protein sequence
MRGDFAPNDR TQTTEAARLN KGLFASQIMS QGQQSDRRFP AEWEAQDGVL IAWPHADSDW 
RSLLDQVDRV YRDLAEAVTR FEQLLIVTPE PDRVAEQLAQ TTANCDRIQI VELPTNDTWS
RDFGPLTVET PAGLRLLDWG FNGWGLKFAA NHDNQVTRRL WQQGIFGTTP LETVPLIFEG
GSIESDGRGT LLTTSQCLLE ANRNPGLSRE AIAQIIQRQL GGDRLLWLEH GHLEGDDTDA
HIDTLVRIAP NDTLIYVACD DPSDSHAAEL TALEAELKAL RAADGQPYHL IPLPWPQPCF
DADGQRLPTT YANYLVINGA VLVPTYNDPA DEAAIAAIAA AFPDRLAIGI NCRPLLEQHG
SLHCITMQLP AGLLSR