Gene Sare_2704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2704 
SymbolcbiD 
ID5707724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3077951 
End bp3079093 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content76% 
IMG OID641272162 
Productcobalt-precorrin-6A synthase 
Protein accessionYP_001537532 
Protein GI159038279 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1903] Cobalamin biosynthesis protein CbiD 
TIGRFAM ID[TIGR00312] cobalamin biosynthesis protein CbiD 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.197164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000132557 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGATACG ACCTGCCGCC GCTGCGCGAG CCGGACCTCC CGCGTACCGC GAAGGTCCGG 
CCCGTCGCGC TGCGCACCGG CTGGACCACC GGCGCCTGCG CCACGGCCGC GGCGAAGGCC
GCGTTGACGG CGCTGGTGAC CGGTGTGGCA CCGGCGGAGG TCGAGATCGG ACTGCCGGCC
GGGCGGCGGG TGCGCTTCCC GGTGGCCCGC TGCGACCGCA GGGACGAGGG CGCCGAGGCG
GTGGTGGTCA AGGACGCCGG CGACGACCCG GACGTCACCC ACGGTGCGGA GCTGACCGCC
ACCGTCGGCT GGCGGCCGGT GCCCGGGCTG GCCCTGGAGG GCGGGCCCGG GGTCGGCACG
GTGACCAAGC CGGGGCTGGG ACTGGCGGTC GGCGGACCGG CGATCAACGA CACTCCGCGC
CGGATGATCG GTGAGGCGGT CGCCGAGGTG GTTGACCTGA CCGCCGTCGG CGTTCGGGTG
GTGATCAGCG TCCCCCGCGG GGAGATCATG GCCCGCAAGA CCACGAACCG CCGGCTCGGC
ATCGTCGGGG GCATCTCGAT CCTGGGTACG ACGGGCATCG TCCGACCGTT CTCCACCGCG
TCCTGGCGGG CCAGCGTCGT GCAGGCGGTG CAGGTGGCGG CCGCCCAGGG GGAACGCACG
GTGGTGCTGT GCACGGGTGG GCGCACCGAG CGGGGCGCCC GGGCGCTGTT GCCGGAACTG
CCGGAGGTGT GCTTTGTGGA GGTCGGCGAC TTCACGGGGG CGGCGGTCAC GGCCGCGGTC
ACCCACGGCC TGTCCGGGGT GGCCTTCGTC GGCATGGCCG GCAAGCTGGC CAAGCTCGCC
GCCGGGGTGC TGATGACCCA CTACACCCGC TCGAAGGTCG ACCTGTCGCT CCTCGGCGCC
GTCACTGCCG AGGCGGGTGG CACCGCCGAC CTGGCCACCG CCGTCACCGC CGCCAACACC
GGTCGGCACG CGTACGAGTT GTGGGAGGCC GCCGGCCTGC TCGGCCCGGC CGGCGACCTG
CTCTGCAGCC GGGTCCGGGC GGTGCTGCGG CGCTTCGCCG GGGATGCCGT CGCCGTCGAC
GTGGCCATGG TCGACTTCAC CGGGGCGCGG GTGGTCGCCT CCTCCGGGCG GTGGGCCCGG
TGA
 
Protein sequence
MGYDLPPLRE PDLPRTAKVR PVALRTGWTT GACATAAAKA ALTALVTGVA PAEVEIGLPA 
GRRVRFPVAR CDRRDEGAEA VVVKDAGDDP DVTHGAELTA TVGWRPVPGL ALEGGPGVGT
VTKPGLGLAV GGPAINDTPR RMIGEAVAEV VDLTAVGVRV VISVPRGEIM ARKTTNRRLG
IVGGISILGT TGIVRPFSTA SWRASVVQAV QVAAAQGERT VVLCTGGRTE RGARALLPEL
PEVCFVEVGD FTGAAVTAAV THGLSGVAFV GMAGKLAKLA AGVLMTHYTR SKVDLSLLGA
VTAEAGGTAD LATAVTAANT GRHAYELWEA AGLLGPAGDL LCSRVRAVLR RFAGDAVAVD
VAMVDFTGAR VVASSGRWAR