Gene Synpcc7942_1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1861 
Symbol 
ID3775224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1931340 
End bp1932605 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content58% 
IMG OID637800302 
Productperiplasmic binding protein of ABC transporter for natural amino acids 
Protein accessionYP_400878 
Protein GI81300670 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAGC GATCGCGGTC GGCTTGTGTT GCACTGATGT CTACAGTGCT ACTGGTCAGT 
TGTCAAACGC AAGTCCAGTG GCCGTGGCAG GATCCGGGGG GGCTCAAGTT GGGTAGCCTG
CTGCCGCTGA CGGGAGATTT GGCTCAGTAT GGCCGACCCA TGCAAGACAC TGCTGAATTG
CTGGTGCAGA CGGTCAATGC CTGCGGCGGG GTGCAGGGGC TGCCAGTCCG CCTGATTCCA
GCAGACGATG AAACCAAACC CGATCGCGGC GTTGCGGCCA TGACCAAACT GGCGGAAGTC
GATCGGGTAG CAGGGGTGGT CGGCGCAGCG GCCAGCAATG TTTCGGATGC AGCCTTGACC
CTAGCGGTCA ACAACCGTGT GGTGATGATC TCGCCTTCCA GCACCAGTCC TCGTTTTACG
GAACGGGCCC GCCGGGGGGA TTTCAAAGGC TATTGGTTCC GAACGGCTCC CTCCGATGCG
CTGCAAGGGC CAGCCTTGGC CAAACTGGCG CTGGATCAAG GCTGGCGATC GGTCTCGGTC
ATTGCCATCA ATAATGACTA CGGCAATGGG TTGCTGCGAT CGTTCATTCC CGCTTTTGAG
CAGGCCGGCG GAGTGGTGTT TAACCGCGAT CAGCCGGTGC TTTACACGCC TGATGCCAGT
AGCTTTGACA GCGAAGTGGA ACAGGTTTTT CGCGATCGCC CCGATGCTGT GGTGTTGATT
GGCTACCCAG ATTCCGGCGC CCTCATTCTG AAAAGTGCTT ACGAAAAAGG ACTGTTGGGC
CAGTCTACGC AAATGTTGCT GACCGATGGT CTGAAGACTG ATCAATTGGC GGAGCTGGTC
GGACGTAATC CTCAGGGTCG CTACATCGTG CAGGATCTCG TCGGCGTTGC CCCCAGTTCG
GGCGGCCCGG GCCGCGAGGC GTTCCTCAAG CGTTATCAAG AGCGTTTTCA GCGATCGCCG
CAGGTGTTTG ATGCCAATAC TTGGGATGCC GCAGCGCTCT TGGTTCTGGC GGCTGAGAAA
AGTAAGTCCT TGGAAGGGGA AAAGCTCAAG GACAGCGTGG CAGCGATCGC TAATGGTCCG
GGAGAGCCAG TCAGTGATAT CTGTCAGGCC TTGGCCCTAG TGCGCGCGGG CAAACCGATC
AACTATCAAG GGGCCAGTAG CGAGCTCAAG CTGGACAACA ATGGCGATGT TTCTGGGCGC
TATGACTTCT GGCAGTTTGA TGCTGATGGC AAGGTCAAAA TCCTCAAGAC CGAGAGTTTT
CAGTAG
 
Protein sequence
MMQRSRSACV ALMSTVLLVS CQTQVQWPWQ DPGGLKLGSL LPLTGDLAQY GRPMQDTAEL 
LVQTVNACGG VQGLPVRLIP ADDETKPDRG VAAMTKLAEV DRVAGVVGAA ASNVSDAALT
LAVNNRVVMI SPSSTSPRFT ERARRGDFKG YWFRTAPSDA LQGPALAKLA LDQGWRSVSV
IAINNDYGNG LLRSFIPAFE QAGGVVFNRD QPVLYTPDAS SFDSEVEQVF RDRPDAVVLI
GYPDSGALIL KSAYEKGLLG QSTQMLLTDG LKTDQLAELV GRNPQGRYIV QDLVGVAPSS
GGPGREAFLK RYQERFQRSP QVFDANTWDA AALLVLAAEK SKSLEGEKLK DSVAAIANGP
GEPVSDICQA LALVRAGKPI NYQGASSELK LDNNGDVSGR YDFWQFDADG KVKILKTESF
Q