Gene Synpcc7942_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1441 
SymbolcobD 
ID3773613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1495174 
End bp1496151 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content59% 
IMG OID637799873 
Productcobalamin biosynthesis protein 
Protein accessionYP_400458 
Protein GI81300250 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1270] Cobalamin biosynthesis protein CobD/CbiB 
TIGRFAM ID[TIGR00380] cobalamin biosynthesis protein CobD 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.282346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.506208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCTG CGTCACTGAC GACGATCGCG GTCTTGGGGT TGGCGGCCTT GCTGGATTAC 
GGTGTCGGCG ATCCCTGGGG TTGGCCGCAT CCGGTGCAGG CTTTGGGCTG GGTCATTGCT
TGCTGGCGCG ACTGGACGTT TCGCTGGCTG AAATCTGCGA TCGCTCAGCG GATCTCAGGC
ATGGTCCTGA CGATTGTTCT GGTGGCTGGT AGCGCGATCG CTAGCTGGGT TGCTTTTGGG
GCGATCGCTC GTCTCTCACC ACTCCTCTCG GCAGGTCTGC AAGTGATTCT GCTGGCAAGC
TGTTTCGCCG GTCGCAGCTT GCGGGAAGCA GCTGCGGAAG TTCTGAAACC CCTAGCTGCT
GAGGATTTGC CAGCAGCTCG AAGGGCACTG AGTCGCTACG TGGGCCGCGA TACTGATCAG
CTGTCGGCGC TCGAAATTCA GCGAGCGGTG CTGGAAACGG TGACTGAAAA TTCGACGGAT
GGTGTTTTGG CACCACTGTT CTATGCCGGA TTAGGAGTAT TGCTGGGACT TGGCCCTGTT
CCGTTGGCGA TCGCCTATAA GGCTGCCAGC ACCTTGGATT CGATGGTGGG CTACCGCCGC
CCGCCCTACA CGAACCTAGG TTGGTTTCCA GCTCGTAGCG AGGATGTCTG GACTTGGTTG
CCCTGCCGCT TGGTGGTGCT GACGATCGCG CTATTCAGTG GTCAGCCCCG ACAGGTCTGG
CAAATTTGCT GCCGCGATGC TCCGGCGGAT CCCAGTCCCA ATGCAGGCTG GAGCGAAGCG
GCCTACGCAG CTGCGCTGGG GGTTCAAGTC GGCGGCGACA ACGTCTACCA AGGTCAAATC
GTCTCGAAGC CGCTACTGGG GGATCCACAG CGATCGCTGG ATGCCACAGT CATTCAGCAA
GCCTTGCAGT TAACCCGCAT CGCTTTTTTG CTTTGGTTAG CTGTGATCGC GGGACTGCTA
CTAGCGTTGG GGCATTAG
 
Protein sequence
MMSASLTTIA VLGLAALLDY GVGDPWGWPH PVQALGWVIA CWRDWTFRWL KSAIAQRISG 
MVLTIVLVAG SAIASWVAFG AIARLSPLLS AGLQVILLAS CFAGRSLREA AAEVLKPLAA
EDLPAARRAL SRYVGRDTDQ LSALEIQRAV LETVTENSTD GVLAPLFYAG LGVLLGLGPV
PLAIAYKAAS TLDSMVGYRR PPYTNLGWFP ARSEDVWTWL PCRLVVLTIA LFSGQPRQVW
QICCRDAPAD PSPNAGWSEA AYAAALGVQV GGDNVYQGQI VSKPLLGDPQ RSLDATVIQQ
ALQLTRIAFL LWLAVIAGLL LALGH